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Abstract 

We study multi-parameter Carnot-Caratheodory balls, generalizing re- 
sults due to Nagel, Stein, and Wainger in the single parameter setting. 
The main technical result is seen as a uniform version of the theorem of 
Frobenius. In addition, we study maximal functions associated to certain 
multi-parameter families of Carnot-Caratheodory balls. 



1 Introduction 

In the seminal paper [NS W85 , Nagel, Stein, and Wainger gave a detailed study 
of Carnot-Caratheodory balls. The main purpose of this paper is to develop 
an analogous theory of multi-parameter Carnot-Caratheodory balls: a situation 
where the methods of NS W85] do not apply in general. We will see that the 
main results for multi-parameter Carnot-Caratheodory balls follow from a cer- 
tain "uniform" version of the theorem of Frobenius on involutive distributions 
We will prove this version of the theorem of Frobenius by building on the work 
of [NSW85j along with work of Tao and Wright |TW03j . Our primary moti- 
vation is to obtain the properties of multi-parameter balls which are relevant 
for developing a theory of multi-parameter singular integrals, which will be the 
subject of a future paper. To this end, we will estimate the volume of certain 
multi-parameter balls, and we will study maximal functions associated to certain 
families of multi-parameter balls. In addition, we will study the composition of 
certain "unit operators." 

We begin by introducing the notion of a Carnot-Caratheodory ball. Suppose 
we are given q C 1 vector fields, Xi , . . . , X q on an open set f2 C M n ; denote this 
list of vector fields by X. We define the Carnot-Caratheodory ball of unit radius, 

1 Here, and in the rest of the paper, we are considering (possibly) singular distributions. 
That is, the dimension of the distribution may vary from point to point. 
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centered at G O, with respect to the list X byH 



B x (xo) ■= < v e ft 



3 7 : [0,1] -> fi, 7 (0) = x , 7 (1)= y, 

i (t) = £ Orf («) ^ (7 («)) , G ([0, 1]) , 




< 1 



£°°([0,1]) 

Now that we have the definition for Carnot-Caratheodory balls with unit radius, 
we may define Carnot-Caratheodory balls of any radius merely by scaling the 
vector fields. This leads us directly to multi-parameter balls, of which the single 
parameter balls of [NSW85 are a special case. 

Fix v > 1, an integer. We will discuss ^-parameter balls. To each vector 
field Xj, we associate a formal degree ^ dj £ [0,00)". We denote by (X,d) 
the list of vector fields 

(x 1 ,d 1 ),...Ax q ,d q ). 

Furthermore, for 5 £ [0,oo) 1 ', we denote by 5 d X the list of vector fields: 

8 dl X 1 ,...,5 d *X q 

where S dj is defined by the standard multi-index notation. That is, S dj — 
riu=i ■ Then we define the multi-parameter Carnot-Caratheodory ball cen- 



tered at xq £ fl of radius S by: 



B(x,d) (xo,S) := B S d X (x ) . 



The theory in [NSW85 concerns the case when v = 1 (see Section 11.2.11 for a 
discussion of their results). One of the main goals of this paper is to develop 
appropriate conditions on the list (X, d) to allow for a general theory of such 
multi-parameter balls. 

It has long been understood that singular integrals corresponding to the 
single parameter balls of [NSW85 play a fundamental role in many questions in 
the regularity of linear partial differential operators that are defined by vector 
fields; in particular, they arise in many questions in several complex variables. 
This began in [FS741 [RS76] . and was followed by [SC841 IFSC861 IJSC87] . These 
works were followed by many others; too many to offer a detailed account here. 
In the area of several complex variables, some examples are |Chr88| INRSW891 
CNS92, Koc02 . Recently, however, multi-parameter singular integrals, where 
the underlying geometries are non-Euclidean, have been shown to arise in various 
special cases in several complex variables and in the parametricies for certain 
linear partial differential operators. Moreover, these examples are not even 



2 Here, and in the rest of the paper, we write 7' (i) = Z (t) to mean 7 (t) = 7 (0)+ f Z (s) ds. 
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amenable to the usual product theory of singular integrals (as is covered in, for 
example, [NS04 ): the geometries overlap in a non-trivial way. In this vein see 
[MRS95I INRS011 INS061 IStr08] . It is our hope that this paper will help play a 
role in unlocking more general theories. 

1.1 Informal statement of results and outline of the paper 

In this section, we offer a brief overview of some of the key results of the paper. 
One of the main aspects of the proofs, and of the interrelationships between the 
results, is keeping careful track of parameters the constants in the results depend 
on. This makes the rigorous formulation of these results somewhat technical. 
Because of this, in this section, we state the results only in the C°° category 
(while we will later deal with less smoothness) and are not precise about what 
parameters the constants depend on. After each result we will refer the reader 
to the part of the paper which contains the precise formulation of the result. In 
addition, Theorem 11.21 represents only a special case of the main result of the 
paper (Theorem 15.31) . 

Before we begin, we need a few pieces of notation. Given two integers 1 < 
m < n, we let I (m, n) be the set of all lists of integers . . . , i m ), such that: 

1 < h < «2 < • ■ ■ < 4 < n. 

Furthermore, suppose A is an n x q matrix, and suppose 1 < n < n A q, for 
/ G I(rio,n), J G I(rio,g) define the tiq x uq matrix Ai j by using the rows 
from A which are listed in / and the columns of A which are listed in J. We 
define: 

det A = (det A/,j)/ eI (n ,n) • 
noXno Jez\n , q ) 

In particular, det„ oX „ A is a vector. It will not be important to us in which 

order the coordinates are arranged. For further information on this object, see 

Appendix [Bj 

For a vector » e I", we write \v\ for the usual i 2 norm, and and \v\ 1 
for the £°° and i 1 norms, respectively. For a matrix A, we write \\A\\ for the 
usual operator norm. Finally, we write B n (ij) for the ball in K™, centered at 0, 
of radius 77 > in the | • | norm. 

The setup of the main result is as follows. We are given q C°° vector fields 
Xi, . . . , X q defined on a fixed open set S1CI". Corresponding to each vector 
field we are given a formal degree 7^ dj G [0,00)", where v is a fixed posi- 
tive integer. We let (X, d) denote the list of vector fields with formal degrees 
(Xi,d%) , . . . , {X q , d q ), and we let X denote the list of vector fields X%, . . . , X q . 
At times, we will identify X with the n x q matrix whose columns are given by 
Xi , . . . , Xq (similarly for other lists of vector fields) . Our main assumption is 
that for every 5 G [0, l) 1 , with \S\ sufficiently smallo we have: 

[5^X^X k ] =X>$5***<- ( L1 ) 
1=1 

3 Throughout the rest of this introduction, 8 will always denote a small clement of [0, 1)". 
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We assume that c l ^ k £ C°° uniformly in 8; i.e., that as 8 varies, c l - 5 k varies over 
a bounded subset of C°° 

Remark 1.1. Note that we have not assumed that the list of vector fields X 
spans the tangent space. This will prove the be an essential point in much of 
what follows. One thing to observe is that while X may not span the tangent 
space, (jl.ll) implies that the distribution spanned by 8 d X is involutive, and 
therefore the classical theorem of Frobenius applies to show that these vector 
fields foliate into leaves (see Section [I.2.5l for a review of the classical theorem 
of Frobenius). The Carnot-Caratheodory ball Br x ,d) { x o>$) is then an open 
subset of the leaf passing through Xq generated by this distribution. In what 
follows, we will estimate the volume of this ball (denoted by Vol (i?(x,d) (^o, <5)) )■ 
This volume is taken in the sense of the induced Lebesgue measure on the leaf. 

For uq < q and J = . . . ,j no ) € Z(no,q), we write (X,d)j to denote 
the list of vector fields with formal degrees {Xj 1 , dj 1 ) , . . . , {-Xj , dj n ) , and we 
write Xj to denote the list of vector fields Xj 1 , . . . , Xj , similarly we write 
dj for the list of formal degrees dj 1 , . . . , dj n . For each x € ft, let no (x, 8) — 
dimspan{(5 dl Xi (x) , . . . , 8 dq X q (x)} and for each x € Q, and 8 sufficiently 
small pick J {x, 8) £ T (hq (x, 8) , q) such that: 

det S d ^X J{x , s) (x) 
For u € JJ"o( :c >' 5 ) with \u\ sufficiently small, define the map|f| 

x. I \ U-(5 d X) u .X d J(x,6) X ,, 

<& xS (u) = e v >J(*,6)x = e *j<.x,s) x _ 
Our main theorem is: 

Theorem 1.2. Let K be a compact subset of £1. Then, there exist constants 
rj, £ ~ 1 such that for all 8 sufficiently small and all x £ K : 

B (x,d) (x,^8) C $ XyS (B no{X}5) (n)) C B (x ,d) (x,8) 

and 

1. $ x ,5 : B„ o(l ^) (n) B {x ,d) {xq,8) is one-to-one. 

2. ForalluE B„ ( x ,a) (/?), \det na{x j)xn (x,5)d$ x ,s(u)\ « \det no{x ^) X n (x,S) S d X (x) 

3. Vol (-B(x.d) (^j £)) ~ |det„ ( 2 . !( 5) Xno ( ;1 , ,5) 8 d X (x) \ . This is essentially a con- 
sequence of items [7] and\^ 

4 Even in the smooth case, the assumptions in Section 15 . 1 1 require less than we outline here. 

5 Note, the dependence of no {x,8) on <5 only involves which of the coordinates of 8 are 0. 

6 If Z is a C 1 vector field, then e z x is defined in the following way. Let E (t) be the 
unique solution to the ODE -^E (t) = Z (E (i)), E (0) = x. Then, e z x is defined to be E (1), 
provided this solution exists up to f = 1 (which it will if Z has sufficiently small C 1 norm). 
See Appendix lAl for further details. 



dot 8 d X (x) 

no(x,8) xno(x,S) 
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4- Vol (B(x.d) ( x > 25)) < Vol \Bix,d) { x t 5)) ■ This is essentially a consequence 
ofItem\M 



In addition to what is stated in Theorem 11.21 a number of other technical 
results hold which are essential for applications. In particular, the map Q> Xt g can 
be used as a "scaling" map. This is because the pullback of the vector fields 
8 d X via the map &x.5 to B no ^ x §^ (77) satisfy good properties uniformly in x and 
50 We refer the reader to Section [5] for a discussion of these results along with 
the rigorous statement of Theorem 1 1.2 1 Note that in the single parameter case, 
Item E] is the main inequality that must be satisfied for the balls B^ x ,d) (%, 5) to 
form a space of homogeneous type (when paired with Lebesgue measure). This 
is the first sign that these multi-parameter balls, in this generality, will yield 
analogs to some results from the single-parameter Calderon-Zygmund theory. 

As was mentioned earlier, Theorem 1 1.2 1 will follow from a "uniform" version 
of the theorem of Frobenius. To understand this connection, one must first 
understand the connection between multi-parameter balls and single parameter 
balls. Given the multi-parameter formal degrees 7^ dj € [0, 00)", we obtain 
corresponding single parameter degrees, which we denote by ^ d, and are de- 
fined by (%2d)j := J>2u=idj — \dj\ v Given 5, we decompose 8 — 8q6\, where 
5 € [0,oo) and 8\ G [0, oo) 1/ @ Then, directly from the definition, we obtain: 

B (X,d) ( x o, <*) = B (sfX,J2 d) ( x o, s o) = b (s"x,j: d) i x o, 1) • 

Because of this, to prove Theorem 1 1.2 1 for a fixed x £ K and a fixed 8, it suffices 
to prove a result for a list of vector fields with single-parameter formal degrees: 
the vector fields (S d X, £ d) . 

At this point, we change notation. We now work in the single-parameter case 
v = 1. We suppose we are given q C°° vector fields X\, . . , , X q on a fixed open 
set fi C M. n and associated to each Xj we are given a formal degree dj € (0, 00). 
We further suppose that we are given a fixed point xq £ One should think 
of this single-parameter list (X, d) as coming from a multi-parameter list via 
(8 d X, 53 d) as m Theorem 11.21 Our main assumption is that we have: 

1 

[Xj,X k }=J24,k x i (1-2) 



1=1 



where c l j k G C*°°. 

Let no — dimspan{Xi (xq) , . . . , X q (xq)}, and pick J G T (no, q) such that: 



det Xj (x ] 

n xn 



det X (xo) 

n xn 



For u G IR ra ° with |u| sufficiently small, define the map 

$ (u) = e u - Xj x . 



7 See Section l5.2.4l to sec a scaling technique in action. 
8 Of course this decomposition is not unique. 

9 In addition, we need to assume that xq is not too close to the boundary of Q, but we 
ignore such technicalities in this introduction. 
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In what follows, the constants can be chosen uniformly as the Xj and Cj k vary 
over bounded subsets of C°°. The constants do not depend on a lower bound 
for, say, |det noX „ X {xq)\. Our "uniform" version of the theorem of Frobenius 
is: 

Theorem 1.3. There exist 77, £ « 1 such that: 

B { xa) (*o, C $ {B no (77)) C B (X4) (x , 1) 

and 

• $ : S no (77) -» (zo, 1) is one-to-one. 

• for a/Z u G -B„ (77), |det„ oX „ d$ « |det„ oX „ X (x )\. 

Furthermore, if we let Yj be the pullback of Xj via the map <J>, then the list of 
vector fields Yy , . . . , Y q satisfy good estimates. See Theorem for details. 

Let us now describe why Theorem 11.31 can be viewed as a version of the 
theorem of Frobenius (see Section H.2.51 for further discussion on this point). 
Indeed, our main assumption (|1.2[) is exactly the main assumption of the theo- 
rem of Frobenius. Hence under the hypotheses of Theorem 1 1.31 the vector fields 
Xi, . . . , X q foliate f2 into leaves. As mentioned in Remark 1 1.11 B^ x ,d) { x o>0 IS 
an open neighborhood of xq on this leaf. Moreover $ : B no (n) — > B^ x ,d) ( x o, 1) 
is one-to-one. Thus, $ can be considered as a coordinate chart on the leaf in a 
neighborhood of Xq. Hence for each point xq G O, Theorem 11.31 yields a coor- 
dinate chart near xo on the leaf passing through xq . In this way, Theorem 11.31 
implies the classical theorem of Frobenius. The main point is that not only does 
Theorem 11.31 yield a coordinate chart, but it also allows one to take f , 77 « 1 
and it gives good estimates on this coordinate chart; estimates which do not 
follow from the standard proofs of the theorem of Frobenius (see Remark |3.4[) . 
nor from the methods of [NS W85] (see the discussion in Section [1.2.1j) . 

In Section 11.21 we discuss a number of previous, related works, and relate 
our results to these works. In Section [3] we state and prove a precise version of 
Theorem ll.3l in the special case when X\ (xq) X q (xq) are linearly indepen- 
dent, and it is in this section that the main technicalities of the paper lie. We 
refer to this result as a uniform theorem of Frobenius. The proof heavily uses 
methods from Section 4 of [TW03] and methods from [NS W85 , but these need 
to be significantly generalized to adapt them to our situation. In Section U we 
use the results of Section [3] to prove the more general version of Theorem 11.31 
in the case when X\ (xq) , . . . , X q (xq) are not necessarily linearly independent. 
We refer to this as studying Carnot-Caratheodory balls "at the unit scale. 
In addition, we use these results to define smooth bump functions supported 
on these balls, along the lines of those used in [NS01 . In Section [5] we state 

10 Here we mean at the unit scale with respect to the vector fields Xj. Thus, if the Xj are 
very small (as is the case when Xj = 5 d iWj, where S is small), then one can think of it as 
being at a small scale. In addition, we could have equally well referred to this as a theorem 
of Frobenius. We chose this name, though, to emphasize its role in the proof of Theorcm ll,2l 
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and prove the rigorous version of Theorem 1 1.2 1 In Section [51 we use the results 
from Section 14.21 to study multi-parameter maximal functions associated to a 
certain subclass of our multi-parameter balls. In Section [5] we also discuss com- 
positions of certain unit operators (see Corollary 16.81 and Section ll.2.4[) . and 
in Section 16.11 we use these unit operators to discuss the relationship between 
certain quasi-metrics that arise. 

From the discussion proceeding Theorem II .31 it is clear why Theorem 11.31 
implies Theorem 11.21 provided one has appropriate control over the implicit 
constants in Theorem 11.31 Hence a main aspect of this paper is to keep track 
of the appropriate constants in Theorem 11.31 At times, this will be quite tech- 
nical. In addition, we will state our main results with only a finite amount of 
smoothness, further complicating our notations0 In the past, there has been 
some interest in results in the single parameter case, using as low regularity 
as possible. Even in this single parameter context, our results are new in this 
direction. See Section fl. 2.31 for a discussion of this. 

In an effort to ease the notation in the paper, at the start of many of the 
sections of this paper, we will define a notion of "admissible constants." These 
will be constants that only depend on certain parameters. This notion of ad- 
missible constant may change from section to section, but we will be explicit 
about what it means each time. In addition, if k is another parameter, and 
we say "there exists an admissible constant C — C (k)," we mean that C is 
allowed to depend on everything an admissible constant may depend on, and is 
also allowed to depend on k. We use the notation A < B to mean A < CB, 
where C is an admissible constant; so that, in particular, the meaning of < may 
change from section to section. We use A s» B to mean A < B and B < A. In 
some sections, we will use different levels of smoothness assumptions. In these 
sections, we will also dehne a notion of m-admissible constants, where m € N 
denotes the level of assumed smoothness. We will write A < m B for A < CB, 
where C is an m-admissible constant, and we dehne « m in a similar manner. 

We write Q n (77) to denote the unit ball in R", centered at 0, of radius 77 in 
the |-| norm. All functions in this paper are assumed to be real valued. Given 
a, possibly not closed, set U C R n , we write: 



Finally, v\,V2 G K m are two vectors, we write v\ < i>2 to mean that the inequal- 
ity holds for each coordinate. 

Remark 1.4. Throughout the paper we work on an open subset fi C K™, en- 
dowed with Lebesgue measure. At hrst glance, it might seem useful to work more 
generally on a Riemannian manifold (where Lebesgue measure is replaced by the 
volume element); and replace the set of vector fields X\, . . . ,X q with a locally 
finitely generated distribution (endowed with an appropriate (multi-parameter) 
filtration taking the place of the formal degrees). However, our results are local 

11 While it does complicate notation, working with only a explicit finite amount of smooth- 
ness does not complicate our proof. 




|a|<m 
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in nature, and working in such a setting offers no new generality and only serves 
to complicate notation. 



1.2 Past work 

In this section we discuss other results from the literature which are related to 
the results in this paper. In particular, we discuss the work of |NSW85j and 
the work in Section 4 of [TW03] . Next, we discuss other results concerning 
Carnot-Caratheodory balls in the case when the vector fields are not smooth. 
In particular, we discuss the recent works BBP08] and |MM08) . Third, as 
motivation for our study of "unit operators" (and maximal functions) in Section 
[H we discuss the singular integrals from |Str08) . Finally, we discuss the classical 
theorem of Frobenius and make some further remarks on how Theorem II .31 can 
be seen as a "uniform" version. 



1.2.1 Single-parameter balls and the work of Nagel, Stein, and Wainger 

In this section, we discuss the main results of |NSW85| . In fact, their main 
results can be seen as a special case of Theorem 11.21 in the single-parameter 
case {v — 1). 

We are given an open set fi C W 1 and C°° vector fields X\, . . . , X q on fi, 
with corresponding formal degrees di, . . . , d q € (0, oo). [NSW85] assumes two 
properties of the vector fields and formal degrees: 

1. There exist cf , G C°° such that 

[x u xj]= J2 c h Xk - (1-3) 

d k <di+dj 



2. The vector fields X±, . . . , X„ span the tangent space at every point. 



In this context, Nagel, Stein, and Wainger prove Theorcm ll.2l ffor a fixed com- 
pact set K € Q). Note that Item [T] is a special case of (|l.ip . Indeed, one may 
take 

otherwise. 



k,S 



The implicit constants are allowed to depend not only on upper bounds for a 
finite number of the C m norms of the Xj and the c* • (as in Theorem II. 2p . but 
also a lower bound for: 



inf 



dct X (x) 



(1.4) 



This is the fundamental difference between the results of [NSW85 and Theorem 

Indeed, using the connection between single-parameter and multi-parameter 
balls discussed in Section [TTT1 it is not hard to see that Theorem ll.2l is essentially 
equivalent to obtaining the results of [NSW85 without allowing the constants 
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to depend on a lower bound for (| 1 .4[) . Of course, if one does not allow the 
constants to depend on a lower bound for (jl.4p , one should also consider the 
limiting result when the quantity in (|1.4[) is equals 0. I.e., when the vector fields 
do not span the tangent space at every point. This is precisely the statement 
of Theorem II .21 in the single parameter case. 

Use of a lower bound for (|1.4j) is essential to the methods of |NSW85| . It is 
used, for instance, every time the error term in the Campbell-Hausdorff formula 
is estimated^ To explain this, we outline a proof of (a result similar to) Lemma 
2.13 of |NSW85j . We take the setting as above, and consider the map (B q (rj) — > 
17, for some small rj > 0): 

Then one has: 

d0 s (d Sj ) =J2cf s S d <°X k , 

3 = 1 

with c k j' S bounded uniformly for 8 > small. Indeed, the Campbell-Hausdorff 
formula allows one to compute the Taylor series for dOg (d Sj ). One has, for every 
N > 0, 

dO s (d Sj ) =5 d iXj +ai[s- 6 d X, 5 d *Xj] + a 2 [s- S d X, [s ■ S d X, 6 d *Xj]] + ... 
+ ajv-i {commutators of order N — 1} + O (\S d s\ N J , 

where the dj are constants and 5 d s = (5 dl si, . . . , S dq s q ). The first N terms 
are of the desired form by (| 1 . 3|) (or more generally, Thus, the goal is 

to see that O (\S d s\ N ^J is of the desired form. This can be seen directly, by 

taking N so large that N miiij {dj} > maxj {dj}, and using the lower bound for 
(|1.4[) . However, this procedure does not work in the multi-parameter situation. 
Indeed, consider the two-parameter situation. In the case when 5i << 62, then 

the best one can say about the error term O (\S d s\ N ^ is that it is bounded by 

as large a power of 82 as we like (by taking TV large). However, we would need 
it to be bounded by a large power of 5\ to generalize the above proof. It turns 
out that, even in the multi-parameter situation, the error term is of the desired 
form. This follows a fortiori from the results of this paper. Because of this, 
one can use the results of this paper to apply the proofs in [NSW85j to the 
multi-parameter situation. However, since the results in |NSW85j follow from 
the results in this paper, this idea does not improve the main results of this 
paper. This idea does have some uses, though: one can often "lift" results from 
the single-parameter setting to the multi-parameter setting by using the results 
from this paper. This is discussed in more detail in Section [5.2.41 

12 See the appendix of NSW85 for an introduction to the Campbell-Hausdorff formula. 
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At first glance, one might think that the proper generalization of f| 1 . 3[) to 
the multi-parameter situation would be: 



[Xi,Xj]= ]T c* tj X k , (1.5) 

d k <di+dj 

where dj G [0, oo) and the inequality dk < di + dj is meant coordinatewise. 
Just as before this is a special case of (jl.ip , and one may take 



c k,8 = I S d ' +d i ~ d * cl 3 if d k < di + dj , 

'0 otherwise. 



However, unlike in the single-parameter case, (|1.5[l does not encapsulate a large 
fraction of the interesting examples. This is explained in more detail in Section 



1.2.2 Weakly comparable balls, the work of Tao and Wright, and a 
motivating example 

In this section, we discuss the work in Section 4 of |TW03I on "weakly-comparable' 
Carnot-Caratheodory balls. While the results discussed in this section do not 
follow from Theorem[L2j they do follow from the more general Theorem l5 . 31 -this 
is discussed in Section [5.2.11 

To understand these results, we must first understand the main motivat- 
ing example of |NSW85] . Suppose we are given C°° vector fields W% , . . . , W r 
on an open subset C M™. Suppose further that these vector fields satisfy 
Hormander's condition: i.e., Wi, . . . ,W r along with their commutators up to 
some fixed finite order (say, up to order m S N) span the tangent space at every 
point. We assign to each vector field Wj the formal degree 1. We assign to 
each commutator [Wi,Wj] the formal degree 2. We assign to each commuta- 
tor [Wi, [Wj,Wk\] the formal degree 3. We continue this process up to degree 
m, and we obtain a list of vector fields with one parameter formal degrees 
(Xi, d%) , . . . , (X q , d q ). As usual, we denote this list by (X, d). It is easy to 
check that this list of vector fields satisfies the assumptions in Section 11.2.11 It 
is also shown in |NSW85] that the single-parameter balls 

Bswi,...,sw r ( x o) 
are comparable to the single-parameter balls 

B (X4) ( x o,S) ■ 

Because of this, one can use the results of |NSW85j to study the balls 

Bsw lt ...,sw r ( x o) ■ 

We now turn to discussing two-parameter weakly-comparable balls. The re- 
striction to two-parameters is not essential, see Section [5. 2 .11 We suppose again 
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that we are given a list of C°° vector fields satisfying Hormander's condition, 
Wi, . . . , W r . We now separate this list into two lists: 

W[,...,W' rv W?,...,W' r ' 2 , 

so that the two lists together satisfy Hormander's condition, but they may not 
satisfy Hormander's condition separately. Suppose we wish to study the two- 
parameter balls given by 

BsxW'MW" ( x o) ) 

where 61,62 G (Oil) are small. Thus, when 61 — 62 this reduces to the single- 
parameter case discussed above. It is natural to wish for an estimate of the 
form 

Vol (B 2 s 1 w',2S 2 w" (xq)) < Vol (B SlW ',6 2 W" (xo)) ■ (1-6) 

Unfortunately, (|1.6I) does not hold in general (See Example 15. 6p . To obtain 
(|1.6[) . there are three options: 

1. We could restrict the vector fields we consider. This is the perspective 
taken up in Theorem II .21 

2. We could restrict the form of 6 = (61,62)- This is the perspective taken 
up in Section 4 of |TW03j . 

3. We could, more generally, do a combination of the above two methods. 
This is taken up in Theorem 15.31 

We briefly discuss the first two methods, and refer the reader to Theorem 15.31 
for the third. 

The first method is quite straight forward given the Theorem [OJ We assign 
to each of the vector fields W[, . . . ,W' the formal degree (1,0). We assign 
to each of the vector fields W", . . . , W" the formal degree (0, 1). If we have 
assigned a vector field Z\ the formal degree d\ and Z 2 the formal degree d%, 
we assign to the commutator [Z\, Z 2 ] the formal degree di + d 2 . One then uses 
this procedure to take commutators of the W and W" to some large finite 
order, thereby yielding a list of vector fields with two-parameter formal degrees 
(X\,d\) , . . . , (X q , d q ). The restriction we put on the vector fields W and W" 
is merely that this list of vector fields satisfies the conditions of Theorem ll.2F^I 
One can then apply Theorem ll.2l to study these balls. 

For the second method, Tao and Wright noted that one does not have to 
restrict the vector fields one considers, provided one restricts attention to 5 
which are "weakly-comparable." To define this notion, fix large constants k, N . 
We then restrict our attention to <5 = (Si, 62) that satisfy: 

-61 <6 2 < nSf. 

K 

13 It is a consequence of the results in Section l5,3l that the balls one obtains in this manner 
are essentially independent of the order of commutators one takes. That is, if the vector 
fields (X, d) were obtained by taking commutators up to order M, and if (X, d) satisfies the 
assumptions of Theorem 11.21 then the vector fields obtained by taking commutators up to 
order M + 1 also satisfy the assumptions of Theorem 11.21 and yield comparable balls. 
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In this case, one can prove develop a very satisfactory theory of the balls with 
these radii. In particular, one has (11.6[) . See Section [5.2.1l for more details. 

Despite the fact, as is mentioned in [TW03 , that the proofs from NSW85J 
generalize to show everything they needed, Tao and Wright put forth another 
proof method. That these methods can be rephrased, generalized, and combined 
with the methods of [NSW85 and classical methods to prove more general 
results is one of the main points of this paper. 

Remark 1.5. One point we have skipped over in this section is to compare the 
balls B$w (xq) to the balls -B(x,d) (xq,S) in the multi-parameter situation (they 
turn out to be comparable) . This is taken up in Section 15.2.41 

1.2.3 Nonsmooth Hormander vector fields 

The results in |NSW85j were stated only for C°° vector fields. However, it is clear 
from their work that there is a finite number M (depending on various quanti- 
ties) such that one need only consider C M vector fields. Unfortunately this M 
is quite large. This is due to the fact that the uses of the Campbell-Hausdorff 
formula in NSW85 require using very high order Taylor approximations of 
many of the functions involved. 

Much work has been done, in this single-parameter setting, to reduce the 
required regularity in the results of |NSW85j . Quite recently, and independently 
of this paper, two works have made great strides on this problem: the work of 
Barmanti, Brandolini, and Pcdroni [BBP08 and the work of Montanari and 
Morbidelli [MM08 . We refer the reader to these to works for the long list of 
works that proceeded them and for a description of applications for such results. 

To describe these results, suppose we are given vector fields Wi,...,W r 
satisfying Hormander's condition at step s > 1. That is, W\, ■ ■ ■ , W r along with 
their commutators up to order s span the tangent space at each point (see the 
discussion at the start of Section H.2.2[) . Then, BBP08 shows that one can 
recreate much of the theory of |NSW85j provided one assumes the vector fields 
are C s_1 . [MM08 achieves the same thing assuming the vector fields lie in a 
space that is between C 8 ~ 2,1 and C s_1,1 (see |MM08j for a precise statement). 

The regularity assumptions in this paper are incomparable to those discussed 
above0 As far as isotropic estimates go, the work of |BBP08| IMM08] requires 
less regularity than ours. However, our estimates are non-isotropic in nature. 
To understand this, use the vector fields W\, . . . , W r to generate a list of vector 
fields with single-parameter formal degrees {X\,di) , . . . , (X q , d q ) as in the start 
of Section ll.2.21 We then assume that each Xj is C 2 , and assume a non-isotropic 
estimate on the c\ ^ which is weaker than assuming the c\ ^ are in C 2 (here, the 
c\a are as in (jl.3|) ). Note that if one were to replace this with an isotropic 
estimate, we would require that Wj be in C s+2 , which is much worse than the 
results in [BBP08, MM08 . However, the point here is that we do not need to 

14 Of course, our results also apply to the multi-parameter situation, which is not true of 
IBBP08llMM08l . 
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take derivatives of Wj in every direction up to order s + 2, but instead we can 
mostly restrict our attention to derivatives that arise from taking commutators. 

It is likely that the regularity required in this paper is not minimal-even for 
the methods we use. Indeed, we often show that a subset of C 1 is precompact 
by showing that it is bounded in C 2 , leaving much room for improvement. 
Improving this would require an even more detailed study of the various ODEs 
that arise than is already undertaken in this paper, and this would take us quite 
afield of the main purpose of this paper (to understand the multi-parameter 
situation). Ideally, one would like to unify the non- isotropic estimates in this 
paper with the isotropic estimates of [BBP08 , MM08 , we do not attempt to do 
so here but we hope that the results in this paper will help to motivate future 
work in this direction. 

1.2.4 Some multi-parameter singular integrals 

In |Str08) . an algebra of singular integral operators was developed which con- 
tained both the left and right invariant Calderon-Zygmund operators on a strat- 
ified Lie group (see |Str08j for a precise statement). In this section, we outline 
the main technical estimate that was key to the work of Str08 . One of the 
main results of Section [5] is a generalizatiorf^l of this estimate, and we refer to 
the operators that come into play in this estimate as "unit operators." 

For the purposes of this section, we discuss only the case of the three di- 
mensional Heisenberg group H 1 , though all of the results discussed here hold 
more generally on stratified Lie groups. As a manifold H 1 is diffeomorphic to 
R 3 . For an introduction to H 1 , the reader may consult Chapter XII of |Ste93j . 
If we write (x, y, t) € M 3 for coordinates on H 1 , then the group law is given by: 

(a;, y, t) (x', y', t') = (x + x\y + y', t + t' + 2 {yx' - xy')) . 

With this group law, H 1 is a three dimensional, nilpotent Lie group. As such, it 
has a three dimensional Lie algebra. The left invariant vector fields are spanned 
by: 

X L = d x + 2yd t , Y L = d y -2xd t , T = d t . 
The right invariant vector fields are spanned by: 

X R = d x - 2yd t , Y R = y + 2xd u T = d t . 

Note that we have: 

[X L ,Y L ] = -4T, [X R ,Y R ]=4T, 

and T commutes with all of the vector fields. Moreover, the left invariant vector 
fields commute with the right invariant vector fields. 

Using these vector fields, we may create a two-parameter list of vector fields 
given by: 

(X L , (1, 0)) , (Y L , (1, 0)) , (T, (2, 0)) , (X R , (0, 1)) , (Y R , (0, 1)) , (T, (0, 2)) , 

15 See Corollary [6]8] for the statement of this generalization. 
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and we denote this list by (X, d) . It is easy to see that (X, d) satisfies the 
assumptions of Theorem 1 1.2 1 we therefore obtain a theory of the two-parameter 
Carnot-Caratheodory balls B( X ,d) ( x 0i 8). 

Let x be the characteristic function of the unit ball in M 3 , and for r £ (0, oo), 
define 

Xr (x, y, t) = r A x {rx, ry, r 2 t) . 
It is easy to see that Xr (£ -1 C) is supported for £ essentiall\P^I in B( v ^ (C (^,0)). 
Moreover, it is bounded by a constant times Vol (B(x,d) (C (7, 0))) 1 (and these 
results are sharp). For Xr (C£ -1 )> the same is true, but one must use the radius 
(0,i) instead of (±,0). 

Define a left invariant operator and a right invariant operator by: 

Op L (Xr) ■ / l-> / * Xr, OPfl (Xr) ■ f ^ Xr * /• 

A key ingredient of the theory in |Str08j was a study of the Schwartz kernel of 
the operator Op £ (xrj Op R (xr 2 ) (see Section 5.1 of |Str08j ). Let K ri _ r2 (CO 
denote this Schwartz kernel. The results in jStr08j show: 

• Kri,r 2 (CO is supported essentially in B (XA) U, ( i, 

• K ri , r2 (t,Z)<Vol(B {x>d) U,(±± 



• The above two results are sharp. In particular, there is an r] > such that 

we have 



K ri ,r 2 (C,0«Vol(B (X , d) (C 



1 1 

n ' r 2 

Using these results, one can study maximal operators. Indeed, define three 
maximal operators: 

Mf(0= sup —j- 1 / |/(0| 

-Ml/ (C) - sup —j- 1 / |/ (01 dC 

5>0 Vol LB( X)d ) (C (0, 0))J iB (x , d) (C,(i,0)) 

(C) = sup — — — 1 / 1/ (01 d£. 

5>o Vol (S(x,d) (C (0, d))J Js (Xj(J) (c,(o,5)) 
The results above show 

However, it is well-known that the one-parameter maximal functions Ml and 
Mr are bounded on L p (1 < p < 00); due to the fact that B^ x ,d) (C (',0)) and 



^By this we mean it is supported in a comparable ball. 
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B(x,d) (C) (0, •)) §i ye r i se to spaces of homogeneous type. It follows, then, that 
M is also bounded on LP (1 < p < oo). 

The goal of Section |H] is to see how far this proof (in its entirety) can be 
generalized. It was used heavily in |Str08j that the left invariant vector fields 
commuted with the right invariant vector fields. In Section[5]we will see that we 
do not need the relevant vector fields to commute, but can instead just assume 
that they "almost commute." This is made precise in Section [6l 

It is extremely likely maximal results hold for a larger class of our multi- 
parameter balls than what is shown in Section [Bl-but the study of unit operators 
seems very tied to the (rather strong) assumptions in Section^ We content our- 
selves, in this paper, with studying maximal operators under these hypotheses. 
It would be interesting to generalize these results further. 

1.2.5 The classical theorem of Frobenius 

In this section, we remind the reader of the statement of the theorem of Frobe- 
nius. We keep the exposition brief since we use the classical theorem of Frobe- 
nius only tangentially in this paper, and this section is more to fix terminology. 
Suppose M is a connected manifold, and Xi , . . . , X q are C°° vector fields on 
M. Suppose, further, that for each i,j there exist C°° functions c\a such that: 



Conditions like (jl.7[) are referred to as "integrability conditions." In this case, 
we have the classical theorem of Frobenius: 

Theorem 1.6. For each x £ M, there exists a unique, maximal, connected, 
injectively immersed submanifold LCM such that: 

• x £ L, 

• For each y £ L, T y L = span{Xi (y),...,X q (y)}. 
L is called a "leaf. " 

Remark 1.7. Often, one sees an additional assumption in Theorem [TjjJ Namely, 
that dimspan{Xi, . . . ,X q } is constant. This assumption is not necessary, and 
the usual proofs (for instance, the one in |Che46j ) give the stronger result in 
Theorem II .61 This was noted in |Her62j . 

Remark 1.8. Let I? be a C°° module of vector fields on an open set fi C R™. 
We call T> a (generalized) distribution. Suppose that T> satisfies two conditions: 

1. V is involutive. That is, if X, Y £ V, then [X, Y] £ V. 

2. T> is locally finitely generated as a C°° module. That is, for each x € 0, 
there is a neighborhood U containing x such that there exist a finite set 
of vector fields X\, . . . ,X q £ T> such that every Y £ T>, when restricted 
to U, can be written as a linear combination (with coefficients in C°°) of 
X t ,...,XgonU. 




(1.7) 



A- 
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Note, under the above hypotheses, X\, . . . , X q satisfy (|1.7[) on U (since Xj] £ 
T>). Thus, one may apply Theorem ll.6l to foliate into leaves, with each leaf L 
satisfying T y L = T> y , Vy £ L. Often, the Frobenius theorem is stated in terms 
of such an "involutive disribution which is locally finitely generated as a C°° 
module," instead of stated in terms of an explicit choice of generators as we 
have done in Theorem 11.61 The reason we have chosen to state the theorem 
with an explicit choice of generators is that we will need to discuss how various 
constants depend on the generators. 

Remark 1.9. Above we have stated the result assuming the vector fields are 
C°°. In fact, an analogous result (using, again, the usual proofs) holds only 
assuming that the vector fields are C 1 . In fact, there are even results when 
the vector fields are assumed to be merely Lipschitz (see Ram07j). However 
C 1 will be sufficient for our purposes (and most of the applications we have in 
mind require only C°°). 

We close this section with a discussion of the relationship between Theorem 
11.61 and Theorem 11.31 As we mentioned before, Theorem 11.31 implies Theorem 
11.61 To understand the philosophy behind Theorem II .3) let X be an index set, 
and suppose for each a £ I we are given C°° vector fields Xf , . . . , X^ on a 
fixed open set fi. Here, both q and fl are independent of a. Suppose further 
that for every a £ I we have, 

k 

Suppose, finally, that as a varies over X, X? and c^" vary over bounded (and 
therefore pre-compact) subsets of C°° . Since Theorem ll.6l applies for each a £ X, 
one might hope that it applies uniformhFI for a £ X. Indeed, this is the case, 
and is essentially the statement of Theorem 11.31 Hence, Theorem 11.31 may be 
informally restated as saying that the theorem of Frobenius holds "uniformly 
on compact sets" in the above sense. 

As it turns out, the classical proofs of Theorem 11.61 do not work uniformly 
in a in the above sense (this is discussed in Remark I3.4[) . If we fix xq £ SI 
and define tIq = dim span { X" (x ) , . . . , X^ (xo)}, then the classical proofs also 
depend on a lower bound for 

det X a (x ) , 

which may not be bounded below uniformly for a £ Zp^l 

There is another way to view Theorem 11.31 in relation to Theorem 11.61 Let 
X\,.. . ,X q be C°° vector fields satisfying f|1.7[) . Notice we have not assumed 

17 We mean uniformly in the sense that the coordinate charts which define the leaves can 
be chosen to satisfy good estimates which are uniform in a. 

18 It is not a coincidence that the failure of the classical proofs of the theorem of Frobenius 
to be uniform in an appropriate sense lies in the use of a lower bound of a determinant, just 
as in the work of Nagel, Stein, and Wainger (see Section ll.2.111 . Indeed, these two issues are 
closely related. 
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that n (x) = dimspan{Ai (x) , . . . ,X q (x)} is constant in x. The foliation asso- 
ciated to the involutive distribution generated by Xi , . . . , X q is called "singular" 
if no (a;) is not constant in x; and if no (x) is not constant near a point Xo then 
xq is called a singular point. 

In the classic proofs of Theorem ll.6l the coordinate charts defining the leaves 
degenerate as one approaches a singular point. Theorem 11.31 avoids this. This 
is an essential point in Section [6.21 

2 Basic definitions 

Fix, for the rest of the paper, a connected open set O C R n . Suppose we are 
given a list of C 1 vector fields X\, . . . , X q defined on fi, and let X denote this 
list. As mentioned in Section fTTTl we will often identify this list with the n x q 
matrix whose columns are given by the vector fields X\ , . . . , X q . In addition, 
we will define (when it makes sense) X a , where a is an ordered multi-index, in 
the usual wavl 19 ! Thus, X a is an |a|th order partial differential operator. In the 
introduction, we defined the Carnot-Caratheodory ball of unit radius centered 
at xo £ f2. We denoted this ball by Bx {xq). 

It will often be convenient to assume that Bx (xo) lies "inside" of il. More 
precisely, we make the following definition: 

Definition 2.1. Given xo £ ft, we say X satisfies C (xo) if for every a — 
(oi,...,o,)e (i°°([0,l])) 9 , with: 

lll a lll,L°=([oa]) = 

there exists a solution 7 : [0, 1] — >• fl to the ODE: 

1 '(t) = J2a j (t)X j ( 7 (t)), 7 (0) --T . 
3=1 

Note, by Gronwall's inequality, when this solution exists, it is unique. 

As in the introduction, to define Carnot-Caratheodory balls of (possibly 
multi-parameter) radii, we assign to each vector field Xj a formal degree ^ 
dj £ [0,oo) . Here v £ N is a fixed number, independent of j, representing 
the number of parameters. We denote the list (Xi, di) , . . . , (X q , d q ) by (X, d). 
In the introduction, we defined (for 5 £ [0, 00)") the list S d X to be the list of 
vector fields 5 dl Xi, . . . , S dq X q . Then, we defined the multi-parameter Carnot- 
Caratheodory ball B(x,d) {xo,S) := Bsx (xo)- Just as in Definition 12.11 it will 
often be useful to assume B(x,d) (xq,S) lies "inside" of Q, and so we make the 
following definition: 

19 For instance, if a were the list (1, 2, 1, 3), then X a = X 1 X 2 X 1 X S and |a| = 4, the length 
of the list. 
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Definition 2.2. Given xo £ £1 and S £ [0, oo)", we say (X, d) satisfies C (xq, S) 
if S d X satisfies C (xo). 

In addition to the balls B(x,d) (^o, 5) it will be useful to define some smaller 
balls. Given xq G f2 and 6 G [0, oo) 1 ', we define 

B(x,d) (xo, 5) = {y e fl : 3a € R 9 , |a| < 1, y = cxp (a • S d X) x } . 

Note that B(x.d) (x ,S) C B^ x ,d) {xq,5). 

Given a list of vector fields along with formal degrees (X, d) and J = 
(jij ■ • ■ ,jn ) € X(n ,q), we defined in the introduction the list of vector fields 
with formal degrees (X, d)j and the list of vector fields Xj. Namely, (X, d)j is 
the list (Xj 1 , dj 1 ) , . . . , \Xj , dj nQ ) and Xj is the list Xj x , . . . , Xj nQ , while dj is 
the list dj t , . . . , <ij no . 

Note that if (X, d) satisfies C(xq,6), then so does (X, d) j. In addition, we 
have, 

B (x,d) , {xq, S) C B(x,<q (^o, <5) , B{X4),, (^o, 6) C B(x,d) (^o, <5) • 

Often, it will be convenient for our estimates to state the definition of 
B(x,d) { x o, S) in a slightly different way. Thus, given the formal degrees di,...,d q 
and given a a = (oi, . . . , a g ) G M. q , S £ [0, oo)", we define: 

5 d a= {5 dl a u ...,8 d ia q ) , 
S- d a = (S- dl a l7 ...,6- d "a q ) . 

Then we have: 

B (x ,d) (*o, - {2/ G n :3 7 : [0, 1] -> n, 7 (0) = x , 7 (1) = y 

y (<) = a(i)-x( 7 (i)),|||rt||| ioc([01]) <i}. 
3 The (uniform) theorem of Frobenius 

In this section, we present a uniform version of the theorem of Frobenius: the 
special case of Theorem 11.31 when the vector fields are assumed to be linearly 
independent. The work in this section was heavily influenced by the methods in 
Section 4 of |TW03j and those in [NSW85j. In fact, a result similar to a special 
case of Theorem 13.11 is contained in }TW03| , though the result there is stated 
somewhat differently (see Section 15.2.11 for a discussion of their results) . Our 
goal, in this section, is to rephrase and generalize the proof methods from these 
two papers to suit our needs. 

In our context, we are faced with a few difficulties not addressed in |T W03j . 
A main difficulty we face is that we will not assume an a priori smoothness 
that was assumed in that paper. This will require us to provide a more de- 
tailed study of an ODE that arises in that paper. This difference in difficulty 
here, is that while in that paper existence for a certain ODE was proved via 
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the contraction mapping principle, we must also prove smooth dependence on 
parameters. Furthermore, we will generalize their results to vector fields that 
do not necessarily span the tangent space. While this may seem like an artificial 
generalization, it will prove to be essential to our study of maximal functions 
and unit operators in Sections 14.21 and [51 Finally, we must also combine these 
methods with the methods in |NSW85] to prove the relationships between the 
various balls we will define. 

Let X = {X\ , . . . , X no ) be no C 1 vector fields with single-parameter formal 
degrees d = {d\, . . . , d no ) € (0, oo)™° defined on the fixed connected open set 
£1 C E n . Fix 1 > £ > 0, xq e ft, and suppose that (X,d) satisfies C(xq,£). 
Suppose further that the XjS satisfy an integrability condition on Bix,d) ( x 0iO 
given by: 

[Xj,Xk\ = yv i;fc Xi. 
i 

In this section, we will assume that: 

• X\ (xq) , . . . , X no (xq) are linearly independent. 

• ll x illci(fl (JCid) (x a> 0) < °°' for ever y 1 - 3< n o- 
. For |a| < 2, X a c\ k G C° {B {x>d) (x ,S)), and 

12 W Xacl 3*\\c«(B (x , d) ( X0 ,Z)) < °°> 
|a|<2 

for all j, k, I. 

We will say that C is an admissible constant if C can be chosen to depend 
only on a fixed upper bound, d max < oo, for d\, . . . ,d no , a fixed lower bound 
> for d u ..., d na , a fixed upper bound for n (and therefore for no), a 
fixed lower bound, £o > 0, for £, and a fixed upper bound for the quantities: 

Furthermore, if we say that C is an m-admissible constant, we mean that in 
addition to the above, we assume that: 

J2 ll xa 4,fcllco(fl (x , ti) (xo,e)) < °°' 

\a\<m 

for every j, k, I (in particular, these derivatives up to order m exist and are con- 
tinuous). C is allowed to depend on m, all the quantities an admissible constant 
is allowed to depend on, and a fixed upper bound for the above quantity. Note 
that <o, <i, <2j and < all denote the same thing. 

For r\ > 0, a sufficiently small admissible constant, define the map: 

$ : B no (rj) -> B [X4) (x ,£) 
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by 

<& (u) = exp (u ■ X) xq. 

Note that, by Theorem IA.11 $ is C 1 . The main theorem of this section is the 
following: 

Theorem 3.1. There exist admissible constants r\\ > 0, £1 > 0, such that: 

• $ : B no (jft) -> B(x,d) 0&o,£) * s one-to-one. 

• for aZZ it G B no (ni), |det„ oX „ d$ (tt)| w |det„ oX „ X (x )|- 

• B(x,d) (a ,£i) C $(B no (771)) C B (Xid) (x 0) £) C B (JM) (x 0) £)- 
Furthermore, if we let Yj be the pullback of Xj under the map 3>, then we have: 

W Y ^(B noim )) 1 (3.1) 

in particular, 

Finally, if for u G _B„ (771) we define the no x no matrix A (u) bvffil 

(Y 1 ,...,Y no ) = (I + A) Vu 

then, 

sup ||A(«)||<-. 

«£fl» (i)i) z 

This section will be devoted to the proof of Theorem 13.11 

Remark 3.2. In [TW03 , the map $ was defined with a large parameter K. 
Then, a result like (|3.1[) was proven by taking X large depending on m. It 
is important for the applications we have in mind that this procedure is not 
necessary. In our setup, this procedure is similar to taking the parameter k in 
Theorem 13.101 small depending on m; however we will see that we will be able 
to fix K = ^ throughout. 

Remark 3.3. The formal degrees, d\, . . . , d no do not play an essential role in this 
section. Indeed note that they do not play a role in the assumptions for Theorem 
13.11 Moreover, since ss 1, they do not play a role in the conclusion either. 
Indeed, Theorem 13 . 1 1 with any choice of d\, . . . , d no € (0, 00) is equivalent to the 
theorem with any other choice (though the various constants in the conclusion 
of Theorem [3T] will depend on the choice of the ds). The reason we have chosen 
to state Theorem 13.11 with an arbitrary choice of ds (instead of taking, say, 
d\ = ■ ■ ■ = d na = 1) is that when we prove Theorem 15.31 we will be, in effect, 
applying Theorem 13.11 infinitely many times. Having stated Theorem 13.11 for 
general d will allow us to seamlessly apply the results here without any hand- 
waving about how various constants depend on the formal degrees. 

20 Here we are thinking of v« as the vector ( d ui , . . . , 9u„ J • 
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Remark 3.4. As was discussed in Section 11.2.11 the methods in [NSW85j fail 
to prove Theorem 13.11 It is also worth noting that the methods usually used 
to prove the theorem of Frobenius are insufficient to prove Theorem 13.11 For 
simplicity, we discuss the proof in |Lun92] . but similar remarks hold for all 
previous proofs we know of. In |Lun92) , an invertible linear transformation was 
applied to X\ , . . . , X no (call the resulting vector fields Vi , . . . , V no ) . This was 
done in such a way that [Vi, Vj] =0 for every i,j. Because of this, the map: 



is easy to study. Unfortunately, we know of no a priori way to create such an 
invertible linear transformation without destroying the admissible constants. A 
fortiori, however, we may just push forward the linear transformation (I + A) 1 
via the map <E> to obtain such a linear transformation. This idea seems to yield 
no nontrivial new information. 

Remark 3.5. Morally, Theorem 13.11 f along with Theorems 14.11 and 15. 3[) is a 
compactness result. This is discussed at the end of Section [1.2.51 The use of 
this compactness can be seen every time we apply Theorem IA.3I Moreover, 
this compactness perspective was taken up in Section 4 of |Str08] . In fact, one 
of the main consequences of this paper is that one may remove condition 4 of 
Definition 4.4 of |Str08] . and still obtain the relevant results (this is tantamount 
to saying that we do not require a lower bound for a determinant as discussed 
in Section [1.2. Thus, from the remarks in that paper, one can easily see the 
results in this paper from the perspective of compactness. 

The next two lemmas we state in slightly greater generality than we need, 
since we will refer to the proofs later in the paper. 

Lemma 3.6. Fix 1 < n\ < uq. Then, for 1 < j < no, I g X(m,n), J G 
T(ni,n ), x e B [x ,d) (xo,0> 



Proof. We use the notation Cjj to denote the Lie derivative with respect to the 
vector field U , and iy to denote the interior product with the vector field V. 
Cu and iy have the following, well-known, properties: 

• Cuf = U f for functions /. 



u i ^ e 



,u-V 



X 3 det X (x) 



i, j 



< det X(x) . 



ni xni 



[Cu, iv] = i[u,v]- 



Cjjoj = ijjduj + dijjio, for forms w. 
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Fix / = (ii, . . . , i ni ) , J — (ji, . . . , j ni ) as in the statement of the lemma. 
Then, 

det X (x)j j = ix jni ix jn _ x ■ • ■ ix j± dx ix A dx i2 A ■ • • A dx ini . 
Thus, we see: 

Xj det X (x)j j = Cxjix jni i x Jni _ 1 • • • ix^dx^ A dx i2 A • • • A dx ini 

= \*i,Xi n ' ' ' ix Ji dx ^ A dxi z A ' ' ' A dxi ni 

+ ix jni i x , y ix n dx h A dx i2 A • • • A dx ini 

H h ix jni ix jni _ t ■ ■ ■ Hx^XjA^ii A ^2 A • • • A dx^ 

+ ix jni ix jn _j ■ • • «x 31 £x 3 (rf^-t! A <fx i2 A ■ • • A dx ini ) . 

(3.2) 

Every term, except the last term, on the RHS of (|3.2p is easy to estimate. We 
do the first term as an example, and all of the others work in the same way: 



i [x J ,x jni ] i x ]ni _ 1 ■ ■ ■ ix n dx. h A dx i2 A • • • A dx in± 

^ Cj <jni ixjx ini _ 1 ■ ■ ■ ix n dx tl A dx l2 A ■ • • A dx ini 



< 



fe=i 
det X (x) 

n\ X 7l\ 



Since, for each k, ix k ixj x • • ■ ijjjj dx^ A dxi 2 A • • • A d£j ni is either or of the 
form ± det X {x) I Jt for some J' £ I (rii, no). 

We now turn to the last term on the RHS of (13.21). We have: 



Cxj (dx il A dx i2 A • • • A dx in% ) 

= [Cxjdxi x j A dxi 2 A • • • A dxi + dxi x A {Lxjdxi^) A • • • A dxi rei 
+ • • ■ + dxi 1 A dxi 2 A • ■ ■ A (^Xj dxi ) . 

So we may separate the last term on the RHS of (I3.2j) into a sum of ni terms. 
We bound just the first, the bounds of the others being similar. To to this, let 
^•=E^^- Note that 116^1^^^ <1. 



ix jni iXj ^ ■ ■ ■ ix n (CxjdxiJ A dx i2 A • • • A dx ini 

> -rr—ix ix ■ • • ix., dxi A dxi A • • • A dxt 



< 



det X(x) 

m xni 
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since each of the terms ix ix ■ • ■ ix dxi A dxu A • • • A dx; is either or 
of the form ± det X (x) I , j for some I' £ I (rii, n). □ 

Remark 3.7. The reader wishing to avoid the use of Lie derivatives in Lemma 
13.61 should consult Lemma 2.6 of |NSW85j where a similar result in the special 
case ni = n is shown directly, without the use of Lie derivatives. However, the 
proof we give in Lemma 13.61 is easily adapted to other situations that will arise 
in this paper (e.g. Lemmas 14. 101 and 14. 13p . while the proof in INSW85] becomes 
progressively more complicated to generalize. 

Lemma 3.8. For y G B^ X: d) ( x o>£)> 1 — n i ^ n 0; we have 



det X(y) 

111 xni 



det X (xq) 

n i X n i 



In particular, since |det„ oXno X (x )\ ^ 0, |det„ oX „ X (y)\ ^ 0. 

Proof. Since y £ B (X4) (x ,£), there exists 7 : [0, 1] B (x ,d) (xq,Q with 

• 7(0) = Xq, 7(1) = y, 

. Y(t) = a(t)-X( 1 (t)), 
. aG (L~([0,l])r, 



• da IIL~([ ,i]) 

But, then consider: 



< 1. 



d_ 

dt 



det X(-y{t)) 

ni xni 



= 2 detX / , J ( 7 (i))-det^ J ( 7 (t)) 

lEX(ni ,n) 
j£X(m ,no) 

= 2 J] detX/,j(7(«))((o-Jr)detX/,j)(7(t)) 



< 



/£X(ni ,n) 
j£X{ni ,n ) 



det X( 7 (t)) 

ni xni 



where, in the last step, we have applied Lemma 
ity shows: 



Hence, Gronwall's inequal- 



det X(y) 



det X(7(l)) 



< 



det X(j(0)) 



det X (xq) 



Reversing the path 7 and applying the same argument, we see that: 



det X (xq) 


< 


det X(y) 









completing the proof. 



□ 
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Now consider the map <£> : B no (77) — » B( X ,d) (xo,Q- d$ (0) = X (x ), and 
it follows that det noX „ d<£> (0) ^ 0. Hence, if we consider $ as a map to the 
leaf generated by X passing through the point xo , the inverse function theorem 
shows that there is a (non-admissible) S > such that: 

$ : B no (6) -> <E> (B no (S)) 

is a C 1 diffeomorphism. Pullback the vector field Xj via the map <!> to B no (5). 
Call this C° vector field Y r 
Clearly % (0) = J-,. Write: 



y ■ — ^ + a fc ^ 

J duj Z-j J du k 



(3.3) 



with (0) = 0. Moreover, in polar coordinates, for to fixed, Remark IA.2I 
shows that a*? (rcj) is C 1 in the r variable, and it follows that for lu fixed, 
a'j (rto) — O (r). We will now show that ah satisfies an ODE in the r variable. 



The derivation of this ODE is classical (see, for instance, page I 55 of |Che46j . 
though we follow the presentation of [TW03 , ) . and is the main starting point 
for this entire section. We include the derivation here, since it is not very long, 
and is of fundamental importance to the rest of the paper. 
Continuing in polar coordinates, 



Hence, 



$ (r, lu) = exp (r (lu ■ X)) Xq. 



d<f> (rd r ) ($ (r, lu)) = rd<f> (d r ) ($ (r, u)) = rw ■ X ($ (r, lu)) . 



Writing this in Cartesian coordinates, we have the following vector field identity 
on B no (5): 



no 



3=1 3 3=1 

Taking the lie bracket of (|3.4|) with y, we obtain: 

n no 

\% + u 3 [Yud Uj ]) = Yj 



n 



3=1 



3=1 
n 



Y Y 



i=i 



(3.4) 



(3.5) 



where ■ (u) — c^j (<& (u)), and we have used the fact that Yi,Yj =J2 ^j-^fc- 

Remark 3.9. Since y is not C , one might worry about our manipulations in 
(|3.5p . This turns out to not be a problem. Indeed, it makes sense to take the 
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above commutator, since Yi is C 1 in the r variable (and we are commuting it 
with rd r ). Then, the computations on the LHS of (I3.5P may be done in the sense 
of distributions, while the computations on the RHS may be done by pushing 
everything forward via the map We leave the details to the reader. 

We re- write (|3.5[) as: 



E Uj \d U] ,Yi~d Ui J + % - d Ui 



(3.6) 



= - ( E (?* - ( 9 i - - E E ^ (") 

ii=i / 3=1 '=i 



Plugging (|3.3p into (|3.6I) we have: 

E E u J & i) d ^ + E a X 

j=l k=l k=l 



n I no \ n no / no 



= - EE^X E E'vC K - EE E^L 

\3=1 fc=l / k=l \j=l J 1=1 fc=l \j = l / 

(3.7) 

Taking the (9 Ufc component, and writing Y^—y Ujd Uj + 1 = d r r, we have from 
(I3T71) : 

n n n / n \ 

= - £ - E u Ai -EE *A a?- (3-8) 

3=1 3=1 '=1 \J=1 / 

Define two no x hq matrices, A, C u by: 

Ai, k := (a, fc ) , (C„) iifc := ( E«i^j J , 1 < i, k < n . 



Using this, (|3.8I) may be re-written as the matrix valued ODE: 

d r rA = -A 2 - C U A - C u . (3.9) 

Theorem 3.10. Fix i > k > (throughout the paper we will choose K = j)- 
Consider the differential equation: 

d r rA (rw) = -A (rw) 2 - C u (ra) A (rw) - C u (ru>) , (3.10) 

defined for A : B no (n) — > M, loX „ (R), where M„ oXno (R) denotes the set of 
no x no real matrices. Then, there exists an admissible constant r\\ = rji (k) > 
smc/i that there exists a unique solution A £ C {B no (771) ;M„ f)Xno (R)) to fUgP 
satisfying A (rw) = O (r) /or eac/i /ixerf w. Moreover, this solution satisfies: 
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• P(*)ll<l*l- 

• su PteB no ( Vl ) \\ A (t)\\ < «■ 

Furthermore, if G C m (B no (rji)) with \\^j\\ Crn f B ^ j\ < °°; G 
C™ 1 (5n (^) i M„ x „ (K)), and «/ C m<Tll is a fixed upper bound for: 

ll^llc»(^(»})' l<*,J,*<no, 
i/ien, £/iere exists an admissible constant C m — C m (m,C m ^ n ^j such that: 

ll^llcf'»(B nD (r )1 );M no xn D (R)) - Cfn ' 

Note that (I3.10P is not a standard ODE (due to the factor of r on the 
left hand side), and so we cannot apply the standard theorems for existence 
and dependence on parameters. Fortunately, though, we will be able to prove 
Theorem [3TTU1 by adapting the methods of |Izz99j . In |TW03j . the solution A 
was assumed to be a priori C°°, thereby removing many of the difficulties in 
the proof of Theorem 13. 101 Before we begin the proof, we need two preliminary 
lemmas: 

Lemma 3.11. Fix e > 0. Suppose g € C m (B no (e)). Define h on B no (e) by: 

I - fn Q (sui) ds ifr^O, , 
Hru)= r Jo - ' J T ' (3.12) 
[9(0) ifr = 0. 

Then, h £ C m {B no (e)). Moreover, if a is a multi-index with \a\ < m, we have: 

w , C^ttt fn s |q| (d?g) (su)ds ifr^O, 
{dZh) (ru) = ^ J ° \ » y > y ' 1 3.13 
{]^+i(dug)(v) i/r = 0. 

Proof. Note that, since g € C m , the right hand sides of (|3.12p and ()3. 13|) arc 
both continuous in r. Note, also, that to prove the lemma, it suffices to prove 
the formula (|3.13p for g g C°°, as then the linear map g M- h will extend as a 
map C°° — > C m to a map C m C m . Hence, we prove the lemma just under 
the assumption g £ C°° (this reduction is not necessary for our proof, but it 
simplifies notation a bit). 

First, we prove the lemma for r ^ 0. Away from r = 0, h is clearly C°°, and 
so we need only verify the formula (|3.13p . h satisfies the formula: 

d r rh (tuj) = g (ru) . 

Apply d" to both sides of this formula. Using the fact that [d", d r r] = \a\ d", 
we have: 

d r r (d a a h) (ru) + \a\ (%h) (ru) = (d°g) (ru) . 
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Multiplying both sides by r' a \ we obtain: 

3 r rH +1 ffih) (ru) = rl«l (d^g) (rw) 

and (ETT51) follows for r ^ 0. 

Hence, to complete the proof, we need only show that d"h exists at and 
is given by the \ a \ +1 (<9"g) (0). We first consider the case when: 

9^(0) =0, 0<\[3\<m, 

and we prove the result by induction on the order of a, our base case being the 
trivial case \a\ — 0. Thus, suppose we have the result for some a, \a\ < m and 
we wish to show that the following derivative exists, and equals 0: 



d Uj dZh{ru) 



= d Uj 

r=0 



TR+t/o sH(0£ 5 )Md S ifr^O 
if r = 



r=0 



And this will follow if we can show that 
1 



•M+i 



^(d^g)(su;)d S = o(r), (3.14) 



But, by our assumption on g, (d"g) (slu) = O (s 2 ) and (|3.14[) follows, completing 
the proof in this case. 

Now turn to the general case g £ C°°. We may write: 

9( u )= E i(*)( fl )^+*w 

\0\<m 

where g e vanishes to order m at 0. Thus, by linearity of the map g t— > h, it 
suffices to prove the lemma for monomials . Since we know (|3.13l) holds away 
from r = and we know the RHS of (13. 13)) is continuous, it suffices to show 
that if g = vP y then h £ C°°. But in this case, h = i p} +1 u@ £ C°°, completing 
the proof. □ 

Lemma 3.12 ( [Izz99] , p. 2060). Suppose (M,p) is a metric space, and suppose 
(Qn) n=0 is a sequence of contractions on M for which there exists a number 
c < 1 such that: 

p(Qn (x),Q n (y)) < cp(x,y) 

for all x,y £ M and all n. Suppose also that there is a point Xoo £ M such 
that Q n (Xoo) Xoo as n oo. Let xq £ M be arbitrary, and define a sequence 
(x n ) by setting: 

Qn (-^n) • 

Then, x n — > Xoo as n —> 00. 
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Proof of Theorem \3.10\ It is easy to see from the definition C u that: 

HCuMII < Dr 

where D is an admissible constant. Take 771 = r/i (k) > to be an admissible 
constant so small that: 



K 2 + _Jh f K _|_ 1) < K K+ ±^l<£. 
t 2 y -r j - , -r g _ 4 

Our first step will be to show the existence of A using the contraction map- 
ping principle. Moreover, this contraction mapping principle may be considered 
the base case in an induction we will use at the end of the proof, to establish 
the regularity of A. Consider the metric space: 



Dm . 3 



M:={AeC (B no ( Vl ) ; (K)) :A (0) = 0, sup 

0<r<?;i 



-A(ru) 



< 00, 



with the metric: 



p (A, B) = sup 

0<r<))i 



SUp || A (t) || < K 

teB„ ( Vl ) 



{A (rcu) — B (ruj)) 



Note that M is complete with respect to the metric p. Define the map T : M 
(K)), by: 



TAlruj) = S p 



1 J r -A {sujf - C u (aw) A {soj) - C u (sw) ds if r ^ 0, 

if r = 0. 



Note that, by Lemma [3TQ TAeC (B m (771) ; M„„ xn „ (M)). 

Our first goal is to show that T : M — > M. Consider, for < r < r)i, 
lu € S""- 1 , A G M, 

\\TA(rcj)\\ < - r \\A( S oj)\\ 2 + \\C u (soj)\\ \\A(slj)\\ + \\C u (sLj)\\ds 



r 



< 



r 

< K 



1 r 

- (k 2 + Dsn + Ds) ds 
r Jo 



2 , D Vl , D Vi 



< K. 



Thus, by the definition of TA, sup tEBn {r]i) \\TA (t)\\ < n. 
Next, we have: 



-TA (rui) 



< 



(sup (0, A) + Dsn + Ds) ds 



K , , s Dk D 
= -p(0,A) + — + - <oo. 
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Hence, T : M -> M. 

Next, we wish to show that T is a contraction. Consider, suppressing the 
dependence on slo in the integrals, 



{TA (ruj) - TB [ruj)) 













i r 


< - 




r Jo 



< Kp (A,B) + ^p(A,B) 



< - A P(A,B) 



where the last line follows by our choice of r\\. Thus, we have p (TA,TB) < 

Ip(A,b). 

Applying the contraction mapping principle, there exists a unique fixed point 
A € M such that TA = A. This is the desired solution to (f3TT0|) . Since A e M, 
we have sup teBn \\A {t)\\ < n. Moreover, since A = lim n _ ) . 00 T n (0), we have: 



n=0 



p (0, A) = Jim p (0, T n 0) <J2p (T n -\ T n 0) <J2[j) P (0, ^0) = 4p (0, TO) 

" °° n=l 

and for r ^ 0, we have: 



1 



D 



< — / Dsds < — 



-TO (ru) 
r 



and so /o(0,T0) < 1 and therefore p(0, A) < 1. This can be rephrased as 
\\A(t)\\<\t\. 

We now turn to uniqueness of the solution A. Suppose B is another so- 
lution (we are not, necessarily, assuming B £ M). Suppose that, for ui fixed, 
\\B (rw)|| = O (r). Then, we have: 







+ 




+ 






) 




s 




s 




s 







And applying the integral form of Gronwall's inequality to ||r (A — B)\\ shows 
that A — B. 

To conclude the proof, we need to show that if c%j £ C m , then A 6 C m , and 
to estimate the C rn norm of A. First, we show that A 6 C rn . To do this, we will 
show that T"0 -> A in C m (B„ (771) ;M„ oX „ (R)) (here we mean the Banach 
space of those C m functions all of whose derivatives up to order m are bounded 
on B no (771)). We proceed by induction on m, our base case being m = 0, which 
we have already proven, by the contraction mapping principle. Thus, suppose 
ll^,jllc m (s < 00 f° r 1 — *>J)fc < no and suppose 



lim ||T n 0-A|| 



C"»-i(B no (77i);S 



r» (K)) 



0. 
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Fix | a | = 77i. We will show that 

figT"0 

converges in C° (B no (771) ;M„ oXno (R)), and this will complete the induction. 
Note that, by Lemma [3.111 we know that, for each n, T n £ C m . Fix r ^ 0, 

Define j n — T n (0), 700 = A. By Lemma [3. Ill we have, for n < 00, 



dZT ( 7 „) (rw) = — / (-7* - C uln - C u ) ds 

r Jo 

= E -4+T f Sm (-^ln){dTln)~{d^C u ){dTln))ds 



n* 1 1 

Ct\ -\-Ct2 — Ot 



1 



(3.15) 



Define, for I £ C° (B„ (7/1) ;M„ oX „ (R)), and for < n < 00, 



Q n (I) (rw) = - £ ^TT [* m (^7n) (3£ a 7») ^ 

ai+a2 — a 

, ' Jo 



ai+Q2— ck 
1 



(3.16) 



1 



s m d%C u ds 

s m (Ijn + iJ + CJ) ds. 



Note that Q n (I) (u) extends continuously to u = and we have: 
Q n : C° (B no (r)i) ; (K))->C° (£>„ (r?! ); ™noXno 

Putting (|3.15[) and (13.161) together, we see, for < n < 00, 

Q„(a?7n) = asr( 7n ). (3.17) 

Our next goal is to show that Q n is a contraction (77 < 00), as a map 
Q n : C° (B m (771) ; (R) ) -> C° (S no (771 ) ; M„ x „ (R) ) . 
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Consider, for r ^ 0, cj e 5"° 1 , and using that 7„ £ M for all n, 
\\Qn(h)(rw)-Q n (l2)(ru)\\ 

S m [(/l - I 2 ) 7 n + In (h - h) + Cu (h - l 2 ) 



1 



< IKi-'2|| C o (B „ o( , l);M „ ox „ (R))^rr / * m (2« + £>«) 

- II' 1 ~ l2 "C°(B no ( m );M noXno (n)) + J 

3 

< J ll'l - Mlc«(B„ ( m );M„ 0X „ (R)) 

where the last line follows by our choice of r\\. 

Next, fix Z G C° (B„ (r/i) ;M„ oX „ (R)). We wish to show that Q n (Z) 
Qoo (I) in C° (B no (m) ; 

M„ oX „ (R)). Consider, 



S m [d^ (ln~Joo))(dTln)d.S 



s m (d^C u )(d™ (7«-7oo))d« 



Q„ (0 (rw) - Qoo (I) (rw) = - E / 



Ql+Q2— Q: 



E 



1 



m+1 



ai+CK2— a 
Q 2 #0 



r 



E 



1 



1 



r 



m+l 



r m+l 



s m (Z (7„ - 7oc) + (7„ - 7oo) 2 + C,/) ds. 



Using our inductive hypothesis that j n — ¥ 7^, in C m (B no (rji) ;M„ oX „ 
it is easy to show that the above goes to uniformly in (r, uj) as n — > 00. 

In particular, if we let be the unique fixed point of the strict contraction 
Qoo we have that Q n (l^) -> in C° (B no (771) ;M„ oX „ (R)). Using (|3.17|l . we 
have: 

^7n+i = ^T( 7n ) = Q„(9«7 n ). 
Hence, Lemma T3 . 1 21 shows that: 

which shows that <9£T"0 converges in C° (S no (771) ; M noXno (R)). It follows that 
AeC m (^fe);^ (R)). 

Moreover, we have that <9"A = Z^, where Z^ was the unique fixed point of 
Qoo- Hence, by the contraction mapping principle, d™ A = lmin^oo Q^O. It 
follows, by a proof similar to the one we did before for T, that we have: 

H a «^llc°(B» (j7i);M„ oX „ (M)) ^ 4 WQ™ ( )llo(B„ ( J71 );M„ x„ (E)) ■ 
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Let us suppose, for induction that we have p.lip for to — 1. Then to prove 
(|3.11l) for to it suffices to show that: 

IIQoo (0)llc°(S„ o ( ? 7i);M„ oX „ o (R)) - Cm (m,Cm,Vi) 

but this follows immediately from the inductive hypothesis and the definition 
ofQoo- □ 

Now fix rjx and A as in the conclusion of Theorem 13. 101 taking k = \ . 



Lemma 3.13. ^4 _ ,,. = A. 

Proof. Using, as remarked before, that for fixed u, 
just as in the proof of uniqueness in Theorem 13. fO 



A (ru) = O (r), this follows 

□ 



Lemma l3.f 31 shows that we may extend the vector fields Yj by setting: 

"0 

fc=i 

where Aj t k = ( a j)- 

Theorem 3.14. d$ (Yj) = Xj. 

To prove Theorem 13. f 41 we need a preliminary lemma: 

Lemma 3.15. Fix u £ S*™ 0-1 , r$ < r\\, and suppose that for all r < ro, 

|det„ oXrao d$ (ru)\ ^ 0. Then, on the line {ruj : < r < ro}, we have d<& (Yj) = 
Xj, 1 < j < n . 

Proof. Suppose not. Define 

ri = sup {r > : d<£> (Yj) = Xj on the line {r'u : < r' < r} , 1 < j < n } . 

Then we must have r\ < r$ (by continuity). Since Yj = Yj\ B we know 

that n > 0. Since |det no xn (riui) \ ^ 0, the inverse function theorem implies 
that there exists a neighborhood V of r\u such that $ : V — > <£> (V) is a C 1 
diffeomorphism. 

Pick < r2 < ^3 < r\ < r4 such that: 

{r'u : r2 < r < r^} C V. 

Let Y^- be the pullback of Xj to V via the map $. By our choice of 7*1, we have 
that on the line {r'cj : r2 < r' < r$}, Yj = Yj. On the other hand, if we write: 

Yj = d Uj + a k jd Uh 

then the coefficients dj satisfy the differential equation (|3.f 0[) (this follows just 
as before). Away from r = this is a standard ODE, so standard unique- 
ness theorems (say using Gronwall's inequality) show that Yj = Yj on the line 
{r'u : r2 < r' < r^}. This contradicts our choice of r%. □ 
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From here, Theorem l3.14l will follow immediately from the following theorem: 
Theorem 3.16. For all t e B no (771), 



det d®(t) 




det d$(0) 




det X (xo) 


noX»o 




n xn 




n xn 



Theorem 13. 161 in turn, follows immediately from the following lemma and a 
simple continuity argument: 

Lemma 3.17. Fix u 6 < tq < r\\. Suppose for < r < r$, 

|det„ xn d& {roS)\ 7^ 0. Then, for all < r < ro, 



det d$ (ru) 

no xri 



det d$(0) 

"oXIlo 



Here, the implicit constants depend on neither ro nor u). 
Proof. By Lemma |3.15[ for all < r < ro, we have 

d$ (^•)(*(rw))=X i ($(rw)). 
Rewriting this in matrix notation, we have: 

d<P ((/ + A) V u) (* (ru)) = X (* (rw)) 
Thus, by applying (|B.1[) . we have at the point $ (rw): 



det d$(I + A) 

noxno 



det X 




"o xn 





det ((J + A) (J + A) 



|det (I + A)\ y/det (d¥W) 
|det {I + A)\ 



det di> 

noxno 



However, we have ||^4|| < |, and so we have: 



det d$> (ruj) 

n xn 



Applying Lemma ELS we see that 



det X($(ru))) 

n xn 



det X($(ru>)) 

n xn 





det X (so) 




det d$(0) 




n xn 




n xn 



completing the proof. 



□ 
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Proposition 3.18. We have, for m > 0: 



\a\<m+l 



ll A llc-(S r » (r 3l );MI„ oX „ (K)) ~"> L 

7t immediately follows that: 

\\ Y iWc^(B no ( Vl )) ~™ 1 



in particular, 



Proof. We prove the result by induction. Our base case will be m = 0. We 
already know, 

Pllco(B„ oM <^0l. 

Recall, <o, <i, <2, and < all mean the same thing. For notational convenience, 
write the operator: 

Vy/= (Y 1 f,...,Y na f). 
Since yy = (/ + -4) Vu, and \\I + ^llco(B no ( m );M no><no (M)) ~ ^ [t follows that: 

J2 W Ya fWc°(B no ( m )) ~0 ll/llci(B„ ( m )) • 
I«|<1 

Conversely, since v« = U + ^4) 1 Vr and (7 + A) -1 < 

C°(B„ („i);M„ oX „ (R)) 

1 (which can be seen by writing (I + A)^ 1 as a Neumann series), we have: 
\\f\\ci(B no ( Vl )) J2 ll y "/llc"(s„ ( m )) • 
Suppose, for induction, that we have: 

ll/llc»(B B0 (^)) -rn-l H ya /llc«(B„ (r,)) • 

\a.\<m 

Then, note, 

I^IUfB^ta)) 12 ll FQc Lllco(i3„ (r,)) • 

| a | < m 

But, 
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and hence, 



and we have that: 



E 

|a| <m 



Y a d k 



%j\\C°(B no (T,)) 



<m 1 



for all fc. It follows from Theorem 13.101 that: 

\\ A \\c™(B na ( Vl yM noXno (R)) ~™ L 
And thus, we have, using the Neumann series, that: 



(I + AY 



C™(B„ (»7i);M„ oX „ (]R)) 



\I + A\ 



C m (B, lo (m);M„ oX „ 



<™ 1. 



Hence, since y„ = (I + A) S7y and Vv = (I + A) \J U , it follows easily that: 

Wf\\c™+i(B no ( vl )) ra m E ll rQ ^llc°(B„ (^)) ' 
|a|<m+l 

□ 

Now we turn our attention to showing that if we shrink r\\ enough, while 
still keeping it admissible, we have that $ is injective on B no (771). This result is 
essentially contained in ITW03] (see p. 622 of that reference), however we recre- 
ate the proof below for completeness, and to make it clear why each constant is 
admissible. Thus, the next lemma and proposition follow TW03 . 

Lemma 3.19. Suppose Z is a C 1 vector field on an open subset V C M. n , 
and U C V . Then, there exists a 5 > ; depending only on n, such that if 
ll-^llc 1 (f/) — ^> then there does not exist x\ £ U with: 

• e tz xi G U, < t < 1, 

• e z xi = x\, 

• Z( Xl )^0. 

Proof. Suppose the lemma does not hold, and we have an x\ and Z as above. 
In the proof of this lemma, we will use big-O notation-the implicit constants 
will only depend on n. Differentiating the identity: 



we obtain: 



d t7 



e tz Xl =Ol 6 



Z{e tz x Y 



dt 2 



dt 



e tz x x 
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Thus, by Gronwall's inequality 

±e tz x 1 = 
at 



d tz 



t=o 



0(\Z(xi)\) 



for t < 1. Hence, 

^6^ = 0(^1^(^)1). 

Integrating, we obtain: 

^e tz x 1 = Z(x 1 ) + 0(5\t\\Z(x 1 )\). 
Integrating again, we obtain: 

xi = e z x x = xi + Z (xi) + O (S\Z (xi)|) . 
which is impossible if 5 is sufficiently small, completing the proof. □ 

Proposition 3.20. We may shrink r\\, while still keeping it admissible, to en- 
sure that $ is infective on B no (771 ) . 

Proof. We will construct an admissible constant 772 with the properties desired 
in the statement of the proposition, and then the proof will be completed by 
renaming 772, 771 . 

Consider the maps *B Uo (u) — e u ' Y Uo, defined for |u | , |u| < 77', where 7/ > 
is some sufficiently small admissible constant. Notice, since 

ll^llc^ (B^)) ^ 1 

we have by Theorem IA.1I that $„ <E C 2 with C 2 norm admissibly bounded 
uniformly in u . Furthermore, since d^ uo (0) = (7 + A(uq)), and \\A\\ < |, we 
have that |detd\I> Uo (0)| > 1, uniformly in u . 

Hence we apply the uniform inverse function theorem fTheorem lA.3p to see 
that there exist admissible constants 772 > 0, 5 > such that for all U\,U2 £ 
Bn (772) there exists uq £ B no (S) with ui — (uq). Moreover, by shrinking 
772, we may shrink 5. 

Now suppose $ : B (772) — > B/x,d) ( x 0i0 is n °t injective. Thus, there exist 
Til, u 2 £ B no (772) , U\ 7^ u 2 such that 

$(ui) = $(u 2 ). 

But, since there exists ^ u £ B no (S) with w 2 = e u °' Y u 1: we have that: 

=$(u 2 ) = e tl °- x $(u 1 ). 

Setting Z = uq ■ X,we have by Lemma [3T81 that Z is non-zero on Bix,d) (£()>£)• 
Applying Lemma 13. 191 we see that by taking S admissibly small enough (and 
therefore 772 admissibly small enough), we achieve a contradiction. □ 
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Our proof of Theorem l3 . 1 I will now be completed by the following proposition: 

Proposition 3.21 ( NSW85], Lemma 2.16). There exists an admissible con- 
stant £i > 0, such that 

B ( x4) (*o,£i) C$(B„ (m)). 

Proof. Actually, the proof in |NSW85| proves something more general. In our 
case, though, we have already shown that $ is injective (Proposition 13. 20|) . and 
this simplifies matters, somewhat. We include this simplified proof, and refer 
the reader to |NSW85j for the stronger results. 

Fix £i > 0. Suppose y G B( X ,d) ( x o, Thus, there exists <f) : [0,1] — » 
B(x.d) Oo, <t> (0) = xo, (1) = y, 

0' (£) = (&•*) (0(i)) 



with6eL°°([o,i]r, |||er^lLoo ([0 , 1]) <i- 

Define 

T = [t < 1 : (0 € * (S„ (D) , V0 < i' < t] . 

Let to = supT. We want to show that, by taking £i admissibly small enough 
we have that t = 1 and (1) € $ (B no ( 2 §-)). 

Suppose not. Then, we must have that |<J> _1 (0(fo))| — \- Then, we have: 

|=|<J,-i(0(f o ))| 



* _1 (*(*)) 



o 
2 ' 



((6-X)$- 1 ) (0(0) 



provided £i is admissibly small enough. In the second to last line, 

(b-Y)^- 1 ^®)) 

denotes the vector b ■ Y evaluated at the point (0(i)). This achieves the 
contradiction and completes the proof. □ 



4 Carnot-Caratheodory balls at the unit scale 

In this section, we generalize Theorem l3.1l to the case when the vector fields may 
not be linearly independent-thereby completing the proof of Theorem ll.31 Sup- 
pose X = (X\, . . . ,X q ) are q C 1 vector fields with associated single-parameter 
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formal degrees d = (e?i, . . . ,d q ) € (0,oo) 9 , defined on the fixed connected open 
set n C R". Fix 1 > £ > 0, Xo G fi. Let n = dimspan{Xi (x ) ,X q (x )}. 
Fix f > ( > 0, Jo G I (no, g); we assume that: 



det X Jo (x ) 

n xn 



> C sup 

JeX(n ,q) 



det (x ) 

n Xn 



(4.1) 



Recall if Jo = (ji , . ■ . , j no ) > (-^> ^) j denotes the list with formal degrees ( (X,^ , ) , . . . , (Xj no ,dj ng }). 
We also write Xj g to denote the list of vector field Xj 1 , . . . , Xj , and dj to 
denote the list of formal degrees dj 11 . . . , dj ng . Suppose, further, that (X, d) Jo 
satisfies C(x ,£). In addition, suppose that the XjS satisfy an integrability 
condition on B(x,d) , o (a?o,£) given by: 



[X j ,X k ]=J2c l j , k Xi. 



(4.2) 



Without loss of generality, we assume for the remainder of the section that 
Jo = (1, . . . , no). We will also assume that: 

• For 1 < j < n , Xj is C 2 onB {X M), a (^0,0 and satisfies ll^ll C 2^ B(x d) {xa ^ 
oo. J ° 

. For \a\ < 2, 1 < i,j <n a ,l<k<q, G C° (x ,0), and 

< oo. 



< 



E \\ x l c U 

H<2 



C°(i 



; (x,d) J() (a:o^)J 



For H < 1, 1 < i,j,fc< g, X«c^. G C° (x ,0), and 

EM VQ fc II , 
ll^^ c i,j|lc.o(B (J[ -, d)jo (xo,0) 



OO. 



We will say that C is an admissible constant if C can be chosen to depend 
only on a fixed upper bound, d max < oo, for d\, . . . ,d q , a fixed lower bound 
dmin > for di, . . . ,d q , a fixed upper bound for n and q (and therefore for no), 
a fixed lower bound, £o > 0, for £, a fixed lower bound, Co > 0, for £, and a 
fixed upper bound for the quantities: 



Ell* 

M<2 

E 

H<i 





\ C 2 








> 


a fe 








\c° 


> 


a k 








\c° 


'b 



1 < j < "o, 

l<i,j<n Q , l<k<q, 
l<i,j,k< q. 



Furthermore, if we say that C is an m-admissible constant, we mean that in 
addition to the above, we assume that: 



39 



< oo, for every 1 < j < no, 




.jIIc°(b (JM ) Jo (so,0 



\x,4) Jo (<*>,£)) 



< oo, for every 1 < i,j < no, 1 < k < 



) 



< oo, for every 1 < i,j,k < q. 



(in particular, the above partial derivatives exist and are continuous). C is 
allowed to depend on m, all the quantities an admissible constant is allowed to 
depend on, and a fixed upper bound for the above quantities. Note that, as 
before, <o, <i, <2, and < all denote the same thing. 

For r\ > 0, a sufficiently small admissible constant, define the map: 



The main results of this section are the following: 

Theorem 4.1. There exist admissible constants rj\ > 0, £i > £2 > 0, such that: 

• $ : B na (rji) -S- B ( x,d) , o (xo,0 is one-to-one. 

• For all u <E B riQ (r]i), |det„ oX „ d$ (u)\ « |det„ oX „ X (x Q )\. 

• B {x4) (x ,&) C 5 (x , d)jo (ao.Ci) C $(B no ( m )) C B {x ,d) Jo (xo,0 C 
B(x4)j ( x o,0 Q B {x ,d) (xo,0- 

Furthermore, if we let Yj (1 < j < q) be the pullback of Xj under the map 
then we have: 



$ : B na (77) -> B [X4) (x ,0 



by 



3> (u) = exp (u ■ X Jo ) x - 



Il y illc"*(s„ ( 7?1 )) 



in particular, 




(Y 1 ,...,Y no ) = (I + A) V 



U 



then, 



sup p(u)||<~. 



Recall, we have, without loss of generality, assumed Jo = (1, ... , no). 
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Corollary 4.2. Let 171, £1, £2 ^ e as * w Theorem \4-l\ Then, there exist admissible 
constants < 772 < ?7i, < £4 < £3 < £2 sucft i/iat: 

(^0,^4) C S (X:d)/o (2:0,6) C $ (B„ (772)) 
C B (x4)ja (ar ,£ 2 ) C B {x4)jg (x 0) &) Q B {x,d) (^0,6) 
C fl (x , d)jo (x ,a) C $ (£„ (ryx)) C 5 (Xid)jo {x ,0 
Q B (x,d) Jo ( x o,0 C (z ,£) > 

and Vol (B( X ,d) (^07^2)) ~ |det„ oX „ X (xq)\, where Vol (A) denotes the induced 
Lebesgue volume on the leaf generated by the Xj s, passing through the point xo . 

Corollary 4.3. Take £4 as in Corollary \4-2\ Then, there exists <fi € Cq (^o ; 
(here, we mean C 2 as thought of as a function on the leaf), which equals 1 on 
B{x,d) ( x o, £4) and satisfies: 

^(M-i)vo 1 

for every ordered multi-index a. 

Remark 4.4. Later in the paper we will apply Corollaries 14.21 and 14.31 without 
explicitly saying what Jo and £ are. In these cases, we are choosing C — 1 and 
Jo such that: 



det X Jo (x ) 

n X n 



det X (xo) 

no X n 



Remark 4.5. In our definition of admissible constants, we have assumed greater 
regularity on Xi, . . . ,X nQ than on Xj, hq < j < q. In many applications, 
it is easier to just assume more symmetric regularity assumptions, that imply 
the assumptions of this section. Later in the paper we sometimes assume the 
following, stronger hypotheses: 

• (X, d) satisfies C (x ,£_), and (|4.2[) holds on B^ x ,d) i x o,£)- 

• In addition to everything that they are allowed to depend on in this sec- 
tion, m-admissible constants (for m > 2) can depend on a fixed upper 
bound for the quantities: 

ll^llc^x.^Oso.O) ' W Xac ^Wc«(B iX:d) (x„,o) 

where 1 < i,j,k,l < q, and these derivatives are assumed to exist, and 
the norms are assumed to be finite. Admissible constants are defined to 
be 2-admissible constants. 



Remark 4.6. Just as in Remark l3.3[ the djS do not play an essential role in this 
section. 
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Remark 4.7. As mentioned in the Section fTTTl "at the unit scale" in the title of 
this section refers to the unit scale with respect to the vector fields Xj. Thus, 
if the vector fields Xj are very small, one can think of the results in this section 
as taking place at a very small scale. 

Remark 4.8. The observant reader may have noticed that we made no a priori 
bound on Xj, no < j < q. However, we will see using Cramer's rule that (|4.1[) 
implies a bound for Xj at xq. In addition, we will be able to use Gronwall's 
inequality to obtain bounds at points other than xq (see (|4.4p ). The reader 
may wonder, though, that since we have assumed no a priori bound for the 
C 1 norm of Xj (uq < j < q), do we need to insist that they are C 1 ? The 
answer is partially no, though our definitions only makes sense when the Xj are 
all assumed to be C . We will see that the above assumptions will show that 
Xi, . . . ,X no are integrable (see Proposition I4.14| ). and from there all we need 
is that Xj (no < j < q) is C 1 on the leaf generated by X\, . . . ,X no , passing 
through xq. This perspective is taken up in Section 23] i n fact, the main reason 
we have been careful to not assume a bound on the C 1 norm of Xj (no < j < q) 
in this section, is to make clear how these arguments also work in the setup of 
Section O 

Before we prove Theorem 14.11 let us first see how it implies the two corol- 
laries. 



Proof of Corollary \4-2\ We obtain ?7i,^i,^2 from Theorem 14.11 Then, apply 
Theorem 14.11 again with £2 in place of £ to complete the proof of the first part 
of the corollary. 

By the above containments, we have: 

Vol (<f (B no (772))) < Vol (B {x>d) (x , 6)) < Vol ($ (B no (771))) . 

Using (|B.2[) and the fact that 



det d$(t) 

n xn 



det X (xo) 

no X n 



for all t S B no (?7i), the estimate on the volume follows immediately. 



□ 



Remark 4.9. By a proof similar to the one of Corollary 14. 2\ we have that if 
£' > 0, is a fixed admissible constant with £' < £ 2 , then, 



det X (xq) 

n X n 



Vol (B (Xid) (x ,e)) 

Proof of Corollary^ Let tp G C5 (B no (r/i)), with 1/3 = 1 on B no (772). Define 

I otherwise. 
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Then, we see: 



X a (j>{x) 



(y«V) (S- 1 (*)) ifze$(B„ (r?i)), 

otherwise. 



Thus, to prove the corollary, it suffices to show that: 

\Y a 1>\ < (H -i)vo 1 

and this is obvious. 



□ 



We now turn to the proof of Theorem l4.ll The main idea is to apply Theorem 
13.11 to the vector helds (X, d) jQ . 

Lemma 4.10. Fix 1 < n\ < n A q. Then, for 1 < j < no, / E T[ri\,n), 
J £l(nx,q), x e B(x,d) Jo (zo>0> 



XjdetX(x)j 



< 



det X(x) 

ni xni 



Proof. This can be proved by a simple modification of the proof for Lemma [ 
We leave the details to the reader. □ 

Lemma 4.11. For y E B( X ,d) jQ {xo,0> 1 < rii < g An, 



In 



Proof. This can be proved by a simple modification of the proof of Lemma 13.81 
using Lemma 14.101 We leave the details to the reader. □ 



det X(y) 




det X (xo) 


n\ xni 







particular, for all y E B( X ,d)j (xo,Q, dimspan{Xi (y) , . . . X q (y)} = n . 



Take Iq E T (uq, n) such that: 
detX(x ) IoJo 



sup 



detX (xq)j 



Jo 



Lemma 4.12. There exists an admissible constant £ x > 0, £ 1 < £ such that for 
every y E B {x ,d), o (^o,^ 1 ), we have: 



> 



det X{y) 

n xn 



detx (y)i ,j 

Proof. Fix / el(n ,n), J el(n ,q). Let 7 : [0, 1] -> B {x ,d) Ja (xo,£) satisfy: 

y(t) = (ft-x Jo )( 7 (t)) 



with 6 £ L°° ([0,1])"°, 



Il°°([0,1]) 



< 1. 
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By applying Lemmas 14.101 14.111 we see: 

j t |det Xj.j ( 7 (t))\ 2 = det X 7:J ( 7 (t)) ((6 • X Jo ) det Xj.j) ( 7 (t)) 



< 



det X( 7 (t)) 

n Xn 



det X (xo) 

ri xn 



det X (xn) r 



detX(x ) /o Jo 



and therefore, 



l\detXu^(t))\ 2 <c|detX(x ) /oiJo 



where C is some admissible constant. Thus, if t < ^y, we have 



and, 



detX(j(t)) Io Jo a detX(x ) /n , Jn 



detX( 7 (t)) J J < detX(a;o) J)J + det X (x ) IoJo 



< 



detX(xo) IoJo 



(4.3) 



We complete the proof by noting that there exists an admissible constant £ x > 
such that for every point y G (^OiC 1 ) there is a 7 of the above form 

and a t < with y — -f (t). 

Lemma 4.13. Fix I E X (n ,n), J E X (n ,q). Then, 



□ 



| a. | <m 



a detX^j 



Jo 



detX 



Io,Jo 



Proof. For m = this follows from Lemma 14.121 For m > 0, we look back to 
the proof of Lemma l3.6l There, it was shown that Xj det Xi t j could be written 
as a sum of terms of the form 

fdetXrj, 

where I' € I (no, n), J' € I (no, q), and / was either of the form c ^ ? - or / was 
a derivative of a coefficient of Xj (1 < j < no). From this, Lemma T4. 121 and a 
simple induction, the lemma follows easily. We leave the proof to the interested 
reader. □ 
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We now show that on B( X ,d) 7 (^o^ 1 ), the vector fields Xi, . . . ,X no sat- 
isfy the hypotheses of Theorem 13.11 Recall, we have assumed, without loss of 
generality, J = (1, . . . , n ). 

Proposition 4.14. Fori < i,j,k < uq, there exist functions j G C \ B(x,d)j { x 0i£ l 
such that, for 1 < i,j < hq: 



[x i ,x j ] = '£cl j x k . 



k=l 



These functions satisfy: 



Ell Y a ^11 < 1 

\\ A Jo C ij\\co(B lXii) j (so,?)) - m 

| a | < m 

Proof. For 1 < j, k < q, let X^' k ' be the matrix obtained by replacing the jth 
column of the matrix X with Xf., Note that: 

detX io k j = £j,kdetX IoJUM 

where tj.k € {0,1,-1}, and J(j,k) £ T(no,q). Thus, for any 1 < fc < q, we 
may write, by Cramer's rule: 



detX 



(i,fc) 

Iq,Jq 



detX/ 0:Jo ~ 
Hence, we have, for 1 < i,j < no: 



detXj 0] j(; ;fc ) 
detX/ 0jJo 



(4.4) 



pwu =E4^ - E E*^^|§ff * - E 

fc=l 1=1 Vfe=l aetA Wo / ;=1 



4^- 



Given the form of , the desired estimates on the derivatives follow imme- 

□ 



diately from Lemma 14.131 



We now apply Theorem 13.11 to the list of vector fields (X, d) Jg on the ball 
B(x,d) J(j (^OjC 1 )- We obtain £i and rji as in the statement of Theorem 14. II We 



obtain 



But, we know from our initial assumptions that 



det d$(i) 




det X Jo (x ) 


no xno 




110X110 



det X Jo (x a ) 




det X (xo) 


n xn 




n xn 



For 1 < j < no, we have: 

ll y jllc-(B» ( m )) ~ m L 

Hence, to complete the proof of Theorem 14. 11 we need to show the existence of 
£2 and prove the estimates on Yj for no < j < q. We begin with the latter: 
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Proposition 4.15. I|i / fe|lc«(s„ (^ 1 )) ~"i 1, /or n < k < q. 
Proof. By (|4.4p . we see that we may write: 

'detXr 



1=1 



^I ,J(l,k) 



detX IoJg 

Since we already know the result for Yi, 1 < I < no, it suffices to show that: 
det X/ 0i j(; :fc ) 



detX Io . Jo 



o $ 



< m 1. 



C m (B„ ( m )) 



By Proposition 13. 181 it suffices to show that for \a\ < m, 



Y' T 



det X 



i ,J{i,k) 



But, 



Y 



J °~ detX IoJo 
detXr 



o <l> 



< i 



^i ,j(i,k) 



J ° detX Io , Jo 



o $ 



C°(B„ (>Ji)) 

„ detX/ 0i j (i!fc) 



o $. 



LJo " detX /o , Jo 

From here, the result follows immediately from an application of Lemma 14.131 

□ 

We now conclude our proof of Theorem 14. 1[ with the following proposition: 
Proposition 4.16. There exists an admissible constant £2 > such that: 
B (x,d) (^0,6) C B { x4) Ja (xo,£i)- 

Proof. Suppose y £ B( X ,d) (%o, £2), where £2 < £1 < will be chosen at the end 
of the proof. Thus there exists a path 7 : [0, 1] — » B( X .d) ( x o,&), 7(0) = x , 

7(1) = y, 

7' (t) = (&•*) (7 («)) 
where b £ L°° ([0,1]) 9 with |||£r d& ||| 
have: 

V (*)=£> (*)** (tW) 



L°°([0,1]) 



< 1. Then, applying (|4.4p. we 



fc=i 

no f q 



(*) 



:=i \fc=i 



dctX( 7 (f)) Jo , J( ,, fc) 
detX( 7 (t)) 7oiJo 



^i(7(*)) 



=:^a< (*)JT I ( 7 (t)), 
i=i 

and if £2 > is admissibly small enough, by Lemma 14.121 we have that: 

< 1, 



-2d, 1 i2 
04 



proving that y = 7(1) £ #(x,d}, 0o,£i)- 



L°°([0,1]) 



□ 
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4.1 Control of vector fields 



We take all the same notation as in Section @] and define (m-)admissible con- 
stants in the same wavF^I The goal of this section is to understand when we can 
add an additional vector field with a formal degree (X q+ i,d q+ i) {d q+ i G (0, oo)) 
to the list of vector fields (X, d) without "adding anything new." In particular, 
we wish to not significantly increase the size of Bix,d) (xq,t), where r is thought 
of as a fixed constant < £. 

Let X q+ i be a C 1 vector field on B(x.d)j { x o,£) (here we mean that X q+ \ 
is C 1 thought of as a function on the leaf in which Bi x .d) ( X Q,0 nes ; but it 
need not be tangent to the leaf), and assign to it a formal degree d q+ i £ (0, oo). 
Let (^X, dj denote the list of vector fields with formal degrees: 

{(Xi,di) , . . . , (X q+ i,d q+ i)) . 

For an integer m > 1 we define three conditions which will turn out to be 
equivalent (all parameters below are considered to be elements of (0, oo)): 



1. TTOsi.Ti.tri.ffP): 

• |det„ x„ X (£Co ) ! oo > Kl 



det T . 



X(x ) 



det jxj X (x ) 



= 0, n < j < n. 



There exist c l iq+1 G C° (B^ x ,d)j (£o>ti)) such that 



9+1 



[X u X q+1 ] = 4, q +i x i> on B (x,d) Jo («o,Tt) 
j'=i 



with: 



E p a 4+i 

\a\ <rn— 1 



C°(B ( x,d) Jo (xo,ri)J 



H,q+1 



C(B ( x,d) J(J (a:o,n) 



2. (t 2 ,(t 2 ,ct™): There exist c 3 G C° (-B(x,d) Jfl (^o,t 2 )) such that: 







* ^-/|a|<m 









■II <f /T m 

jll C"(B(x.*) Jo (*o,H.)) ^ • 



3. Vf (t 3 ,o-3,4™): There exist c 3 G C° [b (X 4) Jq {x ,t 3 )\ such that: 



'We are still assuming Jo = (1, . . . , no). 
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• E| a |< TO ll^ Q Cill C o( B(X(i)jo(x0)T3) ) <<>?• 

Theorem 4.17. Vf => => =>■ Vf in the following sense: 

1. V™ (fti, ti, <7i, (Tj™) =>• i/iere exisi admissible constants r 2 = T2 (ki,ti,cti), 
C2 = 02 (ki,(7i), and an m- admissible constant a™ — a T smc/i 
thatP^(T 2 ,a 2 ,cr^). 

2. V? (T 2 , <X 2 , CT2 m ) P 3 ™ (r 2 , <7 2 , 

5. 'P3 1 (73, (T3, ct™) =>■ £/iere srisi admissible constants «i = «i (03), cri = 
(Ti (03) and an m-admissible constant a" 1 = tr™ (f™), such that V™ 73, a±, 

Proof. V™ =>■ "P™ follows just as in the proof of (|4.4|) . This can be seen by 

noting that (X,d) can be replaced by ^X,otj in the proofs of Lemmas 14.101 

14.111 14.121 and 14.131 The reader might worry that in the definition of V™ we 
are using X a instead of XJ g ; however, there is no real difference between the 
two, due to (|4.4I) . From there, the proof follows easily, and we leave the details 
to the interested reader. V™ => Vf and Vf => V[ n are both trivial. □ 

Let d^ +1 be a fixed lower bound for d q +\. We have: 

Proposition 4.18. Suppose V\ [r%, <r 2 , c 2 ) holds. Then, there exists an admis- 
sible constant t' — r' (d^ +1 , 12, <r 2 ) such that: 

B (x.d) Oo,i~') C Bt%^ (x q ,t') C B ( x4) Jo (x ,t 2 ) . 

Proof. The first containment is trivial. The second follows just as in the proof 
of Proposition l4~T6l □ 

Proposition 4.19. Suppose V™ ( T 2, 02, cr™) holds. Let rf < r\\ be small enough 
that $ (B„ (?/)) C B(x,d)j ( x 0t t 2)- Let Y q+ \ be the pullback of X q+ \ under $ 
to B no (rf). Then, 

where cr™ = cr™ (cr™) zs an m-admissible constant. 

Proof. This follows just as in Proposition ^. 151 □ 

Remark 4.20. Our assumption on the commutator in Section 0] was 

essentially just that ([Xi,Xj] ,di + dj) satisfied condition V™ for appropriate 
m. 
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4.2 Unit operators at the unit scale 

In this section, we study the compositions of certain "unit operators," which 
will be the core of our study of maximal functions in Section [B]-see Section H.2.41 
for some motivation for the study of these operators. 

Let X\, . . . ,X g , di, ■ ■ ■ ,d q , xq, £, £, q 5 ,-, no, and Jo be as in Section [U in 
addition (for simplicity), we assume the stronger assumptions of Remark 14.51 
We again suppose, without loss of generality, that Jo = (1, . . . , no). In addition, 
suppose we are given v subsets of {{X\, d±) , . . . , (X q , d q )}: 

{(zf, <),..., (zi , dl) } C di) , . . . , (X q , d q )} 

with 1 < fi < v. Suppose these subsets satisfy: 

{(X 1 ,d 1 ),... t (X no ,d no )}Q |J {(Zf,^),---,(^,<)}- (4-5) 

We say C is a pre-admissible constant if C can be chosen to depend only on 
those parameters an admissible constant could depend on in Remark 14. 5 [ plus 
a fixed upper bound for v. We will write A ^ B for A < CB where C is a 
pre-admissible constant. Also, we write A ~ B for A ^ B and B ^ A. 

If we say that C is an admissible constant, it means that we furthermore 
assume that: 

fc=i 

and C is allowed to depend on everything a pre-admissible constant is allowed 
to depend on, plus a fixed upper bound for the quantities: 

y Ik^T^f , N , i <v<v 

\trJ j °°(w*b.0) 

which we assume to exist and are finite. 

Given a function / defined on a set U, and given for each x G U a set V x , 
we define for those y £U such that V y C U: 

Here we are being ambiguous about what we mean by Vol (V y ) and dz. Below, 
V will be replaced by sets lying in the leaf generated by one of the Z^s (or by 
X). We then mean for Vol (•) and dz to refer to the Lcbesgue measure on that 
leaf. Below, we will drop the U from the subscript Au,v, > and it is understood 
to be the domain of /. 

If we let £2 be as in the statement of Corollary 14. jpl the main result of this 
section is: 

23 Note that all of the constants in Corollary 14.21 are pre-admissible in the sense of this 
section. 
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Theorem 4.21. There exist admissible constants < A3,A2, Ai < £2 such that 
for every f 6 C° (B(x,d) (xo,£,)) with f >0, we have: 

A B {x , d) (-M)f ( x o) ^ A B{z „ dni . M) A B{zu ^ l id „_ 1)( .,A 2 ) • • • ^b (z1 dl) (-,A 2 )/ fao) 



< A 



Define n% = ^2a=i 1v ^° P rove Theorem 14.2 11 we need a preliminary result: 

Proposition 4.22. There exist pre- admissible constants < h,l2,h < £2 such 
that for all f > 0, we have: 



A B (x , d) (-,h)f(x Q ) 



-< 



f (e Ul - zl e U2 - z2 ■ ■ ■ e u "- zv x ) dm . ..du v du v 

1O2) V 

3 A B (x , d) (-,h)fM- 



Recall, Q ni (I2) denotes the |-| 6a/Z m M™ 1 of radius l 2 - 

To prove Proposition 14.221 we need some preliminary results. Set l\ = £3, 
take 772 and $ as in Corollary 14.21 

Lemma 4.23. For f > 0, 

/ o $ (u) du < A B{X d){ . M f{x )- 
Proof. Note that: 

Vol($(B„ fo))) 
and so we have: 

f 



det X(xo) 

no xn 



Vol(B (Xid) (10, Ai)) 



f {y) dy £ A B ( . th) f (x ) ■ 



Vol (772))) y$(B„ Q (n 2 )) 

Now applying a change of variables as in (|B.2|) and using that by Theorem 14.11 
for every u € B m (n 2 ), 



det d$ (u) 

n xn 



det X (xq) 

n xra 



we see that: 



B no (m) 



Vol (B (Xjd) (x ,k)) 
/o$(u) du^i4 B(Jti(0 (. ) j 1 )/(«o) 



completing the proof. 



□ 



Let Y^ 1 be the pullback of under the map $. As before, we let Y\ , . . . , K, 
denote the pullbacks of Xi , . . . , X„ . Note that, by (14.51) each of Y\ , . . . , Y, 
appears as at least one of the F/\ 
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Lemma 4.24. There exists pre- admissible constants I2 > 0, 773 > such that 
for all / G C° (B no (%)), / > 0, 



f{u)du< [ f (e Ul Yl e U2 Y2 ■■■e u » Y "o)dui---du v 



(4.6) 



/ (u) du. 

B no (m) 



Proof. Let (u±, . . .u v ) denote the map: 

*(ui,...,u v ) = e Ul - Y± e 112 ^ 2 ■■■e u » Y "0. 

Note that ^ G C 2 (Q ni (77')), provided 77' is a sufficiently small pre-admissible 
constant. Moreover, the C 2 norm is bounded by a pre-admissible constant (by 
Theorem [OJ using that II Y M II , nN ^ 1, by Theorem O). 

Recalling that each YJ- (1 < j < no) appears at least once in some Y£ , for 
each 1 < j ' < hq we pick one such occurrence. Write $ as a function of two 
variables: 

W (u\ u 2 ) , u 1 € Q no (77') , u 2 G <3 ni -no («') 

where tj 1 denotes the coefficients of the above chosen K,-, and u 2 denotes the 
remaining coefficients. For each fixed u 2 , think of as a function of one variable: 

Note d^o (0) = and so by the C 2 estimates of we see that if I2 if a pre- 
admissible constant that is small enough, for every u 2 G Q ni _„ (I2), we have: 

\\dS!*{Q)-I\\<\. (4.7) 



Hence, by the inverse function theorem (Theorem IA.3|) , we may pre-admissibly 
shrink I2 such that: 

• For every u 2 G Q ni -n a (h), *u 2 is injective on Q rlg (l 2 ). 

• (Q no (l 2 )) C Q„ (772), for every u 2 G Q ni -no (fc). 

Hence, by a simple change of variables, we have for u 2 G Q ni - no (^) : 

for / > 0. Applying: 

cfc 2 



^m-no('a) 

to both sides of this expression proves the latter inequality in (14.6[) . 
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We now turn to the former inequality in (|4.6p . Applying the inverse function 
theorem (Theorem IA.3|) and again using (|4.7[) we have that there exist pre- 
admissible constants rj' < I2 and 773 < I2 such that for every u 2 G Q ni -n (t)') 
we have that: 

B no {Vs) C *„2 (Q„ (Z 2 )) . 

Thus a simple change of variables (using (|4.7p ) shows that, for w 2 e Q ni -n (ff) 
and / > 0: 



/ (m) du 



-< 



Integrating both sides in it 2 , we obtain 



/ («) du 



-< 



^0(7)3) 



/ (*„2 (u 1 )) du x du 2 . 



Where in the last line, we used that / > and that 77' < 1%. This proves the 
first inequality in (|4.6|) and completes the proof. □ 

Lemma 4.25. There exists a pre- admissible constant I3 > smc/i i/iai /or aZZ 
/ > 0: 

A B (Xd) (;i 3 )f(x ) £ / /o$(u)dtt. 

Proof. Proceeding as in the proofs of Propositions 13.211 and 14.161 we may find 
a pre-admissible constant I3 > such that: 

B(XA) (X0,h) ^®{Bn (»»))■ 

Note that, by Remark 14.91 we have: 



Vol (B (x4) (x ,h)) 
and it follows that 

A B {XA) (-.i 3 )f(xa) 3 



det X (ceo) 

noxno 



Vol (773))) 



/ (y) dy. 



voi($(B no (773))) y$( B „ (, )3 )) 



Applying a change of variables as in (|B.2|) and using that for all u € B na (773), 
we have: 



det d$(f) 

n xra 



det X(xo) 

no x n 



it follows that: 



yl B(x.d)(-,i3)/( a; o) 



-< 



Vol($(B„ (773))) 
/ («) du 



completing the proof. 



□ 
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Proof of Proposition \4-22\ Fix / > as in the statement of Proposition 14.221 
Apply Lemmas 14.231 14.251 to / and Lemma 14.241 to / o $ to obtain: 

A b (x d) (-.i 3 )f (*o) 3 / f ° * (e UlF V 2 - F2 • • • e u " Y "Q) dm... du v du v 
Using that: 

/ o $ (>^>^ 2 . . . e^ Y "Q\ = f (>^>^ 2 . . . e ^- z " Xo ] 

completes the proof. □ 
Proof of Theorem \4- 2 1\ Let < £' < £2 be an admissible constant so small that: 

where B denotes that A is a relatively compact subset of B. It is easy to 
see that this is possible, and we leave the details to the reader. Further, we take 
< £" < £' to be an admissible constant so small that for every y £ S7o, 

B (Zf,d») (V, £") <s B (x,d) (xo,0: 1 < A 1 < ^ 

We apply Proposition 14.221 to each y £ J7o with £" in place of £ and (Z^,d^) 
in place of (X, d) (and taking ^ = 1) to find admissible constants h,l2,h such 
that for every y £ fio, and every / > 

A B ^, dn (;i 3 )f(y)< f f(e u ^y)du, 

JQ^(h) v 7 (4.8) 

< ^B (zf .,^)(-,ii)/(2/) 
and also applying Proposition 14.221 as it is stated we may ensure that: 

A 

Let Ai = 1% and A2 = ^3. Then, applying (|4.8p ^ times, we see that: 

4 v,*')(-.*j) ;1 v- 1 .' i "- 1 ) c ' ,As) ' ' ' ^ B (2Mi)(-' A 2)/ 

< f f (e ul Zl e U2 z2 ■■■e Vt ' zu xo)dui...du u . 



b (x <,(.,»,)/ (*o) < / / (e^V-* 2 • • • e^ z '^ ) dm . . . du v du v 

JQndh) V 7 (4.9) 
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Applying (|4.9I) yields the second inequality in the statement of Theorem 14.211 
We apply Proposition [422] to each y £ Sl with A2 in place of £ and (Z M , d M ) 
in place of (X,d) (and taking v = 1) to find admissible constants l' 3 ,l' 2 ,l'i < A 2 
such that for every y £ CIq, and every / > 0: 

B *»"*w w ~ U^y \ > " (4.10) 

<^s ( z,^,(-r 1 )/(y) 

and also applying Proposition ^. 221 as it is stated (with A2 in place of £) we may 
ensure that: 

A Bix (. v) f(x a )< f f(e^ zl e^ z2 ---e u " z \ )du 1 ...du v du v 



< A 



B(x,,)(-,ii) /(a;0) 



(4.11) 



Set A3 = 1' 3 . We first claim that, for all / > 0: 

A B {z », dn {. A )f(y)^ A B [z ». dn (-M)f(y)' y^o, \<ii<v. (4.12) 

Indeed, we already have that l[ < A 2 . Moreover, we have by Remark 14.91 

Vol (B( Z »,d») (y, ~ Vo1 (B(zn,d») (y, A 2 )) , y £tt 

and (|4.12p immediately follows. 
Thus we have: 

> [ f(e UlZl e U2z2 ---e u » z "x a \du l ...du u du u 

£ ^B(x.d)(-.A,)/(aS0) 

where in the second to last line, we have applied (|4.10[) f times, and in the last 
line we have applied (|4.11[) . This completes the proof. □ 



5 Multi-parameter Carnot-Caratheodory balls 

In this section, we discuss multi-parameter Carnot-Caratheodory balls. In Sec- 
tion 15.11 we state the main theorem regarding multi-parameter balls (Theorem 
15. 3p . In Section I5~2l we discuss four examples/applications where Theorem 15.31 
applies, one of which is the "weakly-comparable" balls of [TW03 . Finally, in 
Section [5731 we discuss a notion of "controlling" vector fields, which we hope will 
elucidate the complicated assumptions in Section [5. II 
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Before we begin, we need one new piece of notation. Suppose we are given 
formal degrees d\, . . . , d q G [0, oo)". If a is an ordered multi- index, we define 
the formal degree 

i=i 

where kj denotes the number of times that j appears in the list a. Thus if 
6 G [0, oo)", we may define 5 d ^ £ [0,oo) and 5- d ^ e [0, oo] in the usual way. 

5.1 The main theorem 

Suppose Xi, ... ,X q are q C 1 vector fields with associated formal degrees ^ 
di,...,d q G [0,00)". Let K C (think of K = {xq} or, more generally, K 
compact). Suppose that £ G (0, 1]" is such that (X, d) satisfies C (x, £), for every 
x G K. The goal in this section is to apply Theorem 14.11 and Corollaries 14.21 
and 14.31 to the vector fields (SX, d) at each point x G K, where 5 G [0, l) v is 
small. 

Fix a subset A: 

AC{6e [0,1]" : 5^0,S < £} 

to be the set of "allowable" 5s. Recall, S < £ means that the inequality holds 
coordinatewise. 

Remark 5.1. We will be restricting our attention to balls Br x ,d) ( x i$)i where 
5 G A., x G K. For many applications, one would take: 

A = {8e[0 ) l] u :5^0 ) 8<t} (5.1) 

and we encourage the reader to keep this particular choice of A in mind through- 
out this section. However, other choices of A do arise in applications. For 
instance, the choice: 

A = {5 G [0, 1]" : S ^ 0, S < £, 6x > S 2 > ■ ■ ■ > S u } (5.2) 

arises in the study of flag kernels, as in |NRS01j . Also, the results in Section 
15.2.11 use yet another choice of A. 

In this section, we assume that for every 5 G A, x G K, we have: 

k 

on B(x.d) { x ) 8). In addition, we assume: 

• The XjS are C 2 on B^ x ,d) (x, £), f° r every x G K , and satisfy sup x£if ll^jllc 2 (B (x d) (x £)) < 
00. 

• For all \a\ < 2, x G K, we have (S d X) a c k j' x G C° (B (x , d ) (x,6j), for 
every k, and every S G A, and moreover: 

sup V \\{5 d X) a c k t f x f <oo. 
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Finally, let 



n (x,S) = dimspan{<5 dl Xi (x) , . . .,S dq X q (x)}. 



We say C is an admissible constant if C can be chosen to depend only on 
fixed upper and lower bounds d max < oo, d m i n > 0, for the coordinates of d, 
a fixed upper bound for n, q, v and a fixed upper bound for the quantities: 



sup \X 

x£K 



sup J2 

x£K M< 2 



\(S d X 



.<* k,5,i 
1 C iJ 



C°(B (Xtd) (x,5)) 



Furthermore, if we say C is an m- admissible constant, we mean that in addition 
to the above, we assume that: 



• su Px£K \\ x j\\ c ™(B (Xtd) (x,z)) < °°> for ever y 1 - j - 1- 



sup 5ed Eu|< m (5 d X 



xeK 



) C <3 



C(s (x , d) (x,5)) 



< oo, for every i, j, k. 



(in particular, the above partial derivatives exist and are continuous). C is 
allowed to depend on m, all the quantities an admissible constant is allowed to 
depend on, and a fixed upper bound for the above two quantities. 

Remark 5.2. The assumptions in this section are somewhat complicated. The 
reader might hope that special cases of these assumptions might be enough for 
applications. Unfortunately, this seems to not be the case, and is discussed in 
Section 15.31 

For each 5 € A, x € K, let J(x,5) = (x, 5) 1 , . . . , J (x, S) na ^ ^ € 
I (no (x, 5) , q) be such that: 



det (S d X(x)) 

no [x,o) X no {x,o) 



J(x,S) 



det 6 d X (x) 

no(x 1 5)xno(x,5) 



and define, for u <G R™°( ;r > (5 ) with |w| sufficiently small: 

, , u-(5 d X) 

®x,s (u) — e y 'H*J)x. 
The main result of this section is: 

Theorem 5.3. There exist admissible constants r]\,r\2 > 0, < £4 < £3 < £2 < 
£1 such that, for all S € A, x G K : 

B(x,d) {x,&8) C B( Xtd)j(xS) (x,£ 3 5) C (B„ ( XjS ) ("2)) 
C B(x,d) Hx 5) (x,&5) C B {X4)j{x 5) (x,&5) C S( Xid) (x,£ 2 <5) 
C B {x ,d) J(X:S) {x,£i6) C ^ (B no{x , s) (771)) C B( X ,d) J{w <s) ( x > s ) 
Q B(x,d) J( „ iS) (x, 5) C S (Xid) (x, (5) , 



and 
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• $x,s ■ B no{x j) (771) -> B( X ,d) J(x i} (as, 5) is one-to-one. 

• Forallu £ B no ( x , 5 ) (171), jdet^^^jx^^) d$ x>5 ps |det no ^(z) 

• Vol (-B(x,d) (z> 6*)) ~ |det no ( X)5)x „ ( ;C)5 ) <5 d X (a;)|. 

• There exists <j) Xt s £ Cg (-Bpt ,d) O^j ^)); which equals 1 on Br X ,d) ( x i £4^) 
and satisfies: 



Furthermore, if we let Y^' S be the pullback of S dj Xj under the map $> x 
B n (x,S) (m), ™ have that: 



to 



< m 1. 



Finally, if for each x £ K , u £ B no r Xt g\ (r)±), and S € A, we define the uq (x, 5) x 
no (x, 5) matrix A (x, u) by: 



then, 



sup 

«eB„ (.,f)(n) 



||^(a:,«)|| < 



(5.3) 



Proof. For each x £ K and 5 £ A, merely apply Theorem l4 . 1 1 and Corollarics l4.2l 
and 14.31 to (5 d X, Y^d), taking £ = 1 and Jo = J(x,5). It is easy to see, by the 
assumptions in this section, that all of the constants admissible (respectively, 
m-admissible) in those results are admissible (respectively, m-admissible) in the 
sense of this section. □ 

Corollary 5.4. We assume, in addition to the other assumptions in this sec- 
tion, that for every S £ A with \5\ sufficiently small, 5 £ A (in particular, 
this is true if A is given by H5.1\) or 115.2]) ). We have, for x £ K , and all 5 £ A 
with \5\ sufficiently small: 



Vol {B {XA) (x,S)) 



det (^6) X(x) 

no(i)xn (i) 



det S d X (x) 

n (x)xn (x) 



(5.4) 



(5.5) 



and so if \ 5 \ is sufficiently small and 25 £ A, 

Vol (B (Xtd) (x, 25)) < Vol (B ixM) (x, 5)) . 

Proof. (|5.4p follows by replacing 5 with ^ 1 <5 in the statement of Theorem 15.31 
(15. 5|) follows since the RHS of (|5.4p is the square root of a polynomial in 5 (with 
positive coefficients). □ 
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5.2 Applications and examples 

In this section, we present four applications/examples where Theorem 15.31 ap- 
plies. The first two applications were both previously well understood, and in 
fact both can be understood by the methods of |NSW85j . The reason we in- 
clude them here is to put them in the context of Theorem 15 . 31 and because 
they have been useful in the past. The third example is included to provide a 
simple situation where the methods of [NSW85) do not apply but Theorem 15.31 
does. We close this section with most interesting of our applications. In this 
application, we show how to lift results from the single parameter case to the 
multi-parameter case. In particular, we will see how results like the Campbell- 
Hausdorff formula can be applied even in the multi-parameter case-where, at 
first glance, they seem totally inapplicable. 

5.2.1 Weakly comparable balls 

In this section, we discuss the so-called "weakly-comparable" balls that were 
used in ITW03j . We do not attempt to proceed in the greatest possible gener- 
ality, and instead just try to present the main ideas. Most of the conclusions of 
this section are contained in T\\ 1)3!. and the main purpose here is just to show 
how these results are a special case of Theorem 15.31 

Let X\, . . . ,Xu be v C°° vector fields defined on f2, with associated formal 
degrees d\,...,d v € (0, oo). Fix large constants k,N. Essentially, we will be 
considering the balls generated by the vector fields 5^" , where we restrict our 
attention to those 6 = (5i,...,8 v ) such that: 

C < kS^ (5.6) 

for every /zi,/Z2- We call a 6 satisfying (|5 ,6[) a "weakly comparable" 5. 

We assume that Xi,...,X v satisfy Hormander's condition. That is, X\, . . . , X, 
along with their commutators of all orders, span that tangent space at every 
point of fi. Fix K <g fi, a compact subset of Q, and let f2o <£ £2 be such that 

K <E 

Let € [0, oo)" be the vector that is d^ in the /ith component, and in the 
other components . For a list (or a "word" ) w = (wi , . . . , w r ) of integers 1 , . . . , v 
we define: 

r 

d {w) = ^2 dwj , 

3 = 1 

X w = ad (X Wl ) ad (X W2 ) ■ ■ • ad (X Wr _ 1 ) X Wr . 

As before, for a v vector e = (ei, ...,e v ) £ [0, oo)", define S e = Yl^ =1 tiff ■ 

By the assumption that X\, . . . ,X U satisfy Hormander's condition, and by 
the relative compactness of fio, there exist I lists w 1 , . . . , w l such that, for every 
x G Q.q: 

T X Q = span{X TO i (x) X w i (x)}. 
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Let d = sup 1<m<i d{w 



Recall, 



E 



Let (X, d) denote 



the finite list of vector fields along with associated formal degrees given by 



< Nd . 



(^X w ,d(w)^j , where w ranges over all lists satisfying d(w) 

Take £ € (0, l] u so small that (X, d) satisfies C (x,£) for every x € K, with 
fio taking the place of fi in the definition of C (x, £). 

In this section, we say that C is an admissible constant if C can be chosen 
to depend only on a fixed upper bound for n, a fixed upper bound for v, a fixed 
upper bound for N and k, fixed upper and lower bounds for d^ (1 < \i < v), a 
fixed upper bound for do, a fixed lower bound for: 



inf 



det (X w i 0) | • • • \X w i (x)) 



and fixed upper bounds for a finite number of the norms: 



\X 



MllC""(f2 ) ' 



1 < fj, < V. 



Theorem 5.5. Let A = {S e [0, If : 8 ^ 0, 8 < £, 8% < k8^ , V>i, M2} ■ Then, 
with this choice of A, the list of vector fields (X, d) satisfies the assumptions of 
Section \5.1[ where all of the constants that are admissible (or even m-admissible) 
in the sense of that section are admissible in the sense of this section. Hence, 
Theorem 15.31 holds for (X, d) . 



Proof. We will show that if W\ and w 2 are words with 
we have for 8 £ A: 



d(wi) ^ , d{w 2 ) 



<Nd , 



5 d< - w ^X Wl ,5 diw2) X W2 



with 



If 



\c W3 ' S I < 1 

rW!,W 2 \\C m (Q ) ~ 



d(wi) + d{w 2 ) 



< Ndo, this follows easily from the Jacobi identity. We 



proceed, then, in the case when 



d(wi) + d(w 2 ) 



> Ndo. Using that: 



(5.7) 



with 



fe=i 



\c k I < 1 



and multiplying both sides of (|5.7p by: 
the result follows easily. 



□ 
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Example 5.6. An example to keep in mind where the weakly comparable hy- 
pothesis is necessary is given by the following vector fields with formal degrees 
on M 2 : 

(d x , (1,0,0)), (e-^d y , (0,1,0)), (d y , (0,0,1)). 

If we restrict our attention to the case when S3 = 0, Si = <5 2 (which is impossible 
under the weakly comparable hypothesis, without taking Si = = 62) then 
(without being precise about definitions), we are left with the one-parameter 
ball of radius Si "generated" by the vector fields: 

d x , d y 

and it is well known that this sort of ball cannot satisfy any sort of doubling 
condition of the form f|5.5[) . 

5.2.2 Multiple lists that span 

In this section, we suppose we have v lists of C°° vector fields on ft C R" with 
associated formal degrees: 

(Xf,<) , . . . , ,d? € (0,oo),l < » < v 

and we assume that for each [i, the list 

spans the tangent space at each point in f2. Our goal is to consider the balls 
generated by the vector fields: 

spx?, i<n<v, i<i<? M , 

where 5 = (Si, ... , 5 V ) is small. 

Fix K (e H, a compact subset of fl, and take J7o d fi such that K <g £lo- 
Define: 

do := max 

1<?<9„ 

We define dj £ [0, 00)^ for 1 < \i < v and 1 < j < to be the vector 
that is d^ 1 in the /zth component and in all the other components. For a list 
w = ((wi, fii) , . . . , (w r , |U r )) of pairs, where 1 < < v and 1 < uij < we 
define (as in Section [523} : 

r 

i=i 

X„ = ad (X£) ad • • • ad (x^) X£. 
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Let (X, d) denote the list of vector fields with associated formal degrees given 



d(w) 



< da 



by (^X w , d(w)j where w ranges over all those lists with 

Take £ € (0, 1]" so small that (X, d) satisfies C (x, £) for every x £ K, with 
f^o taking the place of f2 in the definition of C (x, £). 

In this section, we say that C is an admissible constant if C can be chosen 
to depend only on a fixed upper bound for n, fixed upper and lower bounds for 
dj (1 < n < v, 1 < j < a fixed upper bound for v, a fixed lower bound for: 



inf 

l<H<v 



det (X* (x) | • • • |X£ (x) 



and fixed upper bounds for a finite number of the norms: 

1 < M < v , 1 < J < %• 



I J llc m (f2o) 



Theorem 5.7. Lei .4. &e given by \5.1\) . Then, with this choice of A, the list 
of vector fields (X, d) satisfies the assumptions of Section 15. jj where all of the 
constants that are admissible (or even m- admissible) in the sense of that section 
are admissible in the sense of this section. Hence, Theorem \5.3\ holds for (X, d) . 



Proof. We will show that if W\ and Wi are lists with 
we have for 5 £ A: 



5 i(w ^X Wl ,8 A ^X w , 



E 



(w 3 ) Xi 



U>3 ! 



< d n 



(5.8) 



with 



If 



n 1>>3,& 



5, 1. 



d (wi) + d (W2) 



\rw!,w u iic m (n ) 
< do, ()5.8j) follows easily from the Jacobi identity. We 



proceed, therefore, in the case when 



d(wi) + d(w 2 ) 



> do- Let us assume 



that the /ith coordinate of d (wi) + d (w 2 ) is greater than d Q . Using that: 



\X WX ,X W ^\ 7 c: 



3 Yf 

/ j X ~"W\ ,11' 2 j ' 

3=1 



(5.9) 



and multiplying both sides of 



by: 



follows easily. 



□ 



Remark 5.8. Theorem 15.71 also follows from the results in Section 4 of [Str08 
(which used the methods of [NSW85] ). In fact, the more general results in 
Section 4 of |Str08j are clearly a special case Theorem [ 
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5.2.3 An example where the methods of [NSW85] do not apply 

As was already discussed in Section [T.2.11 the methods of [NSW85] fail to prove 
Theorem 15.31 The main issue is that the error term given by the Campbell- 
Hausdorff formula cannot be a priori controlled using the methods of [NSW85 
(see Section lT.2.ip . Thus, if one wishes to develop an example where the methods 
of [NSW85] do not apply, one must use vector fields where the error term is not 
obviously controllable. As shown in Section [5.2.41 (see also Section fl . 2 . 1 [) . the 
results of this paper imply that the error term is controllable. The point of this 
section is to offer an example where the methods of |NSW85| do not prove this 
fact. 

In particular, one needs that the error term of the Campbell-Hausdorff for- 
mula not be zero, so the main aspect of the example that follows is that the 
iterated brackets of the vector fields we present are not eventually zero (this 
rules out vector fields with polynomial coefficient q) . 

We work in the two-parameter situation, with A given by (|5.ip . We consider 
the list of vector fields on R 4 with formal degrees "generated" by the vector fields 

(ft + cos (s) ft, (1, 0)) , (ft + cos (x) ft, (0, 1)) . 

More specificly, we consider the list of vector fields with formal degrees: 

(ft + cos (s) ft, (1,0)), (ft + cos {x)d t , (0,1)) (sin (s) ft - sin (x) ft, (1, 1)) , 

(cos (a;) ft, (2,1)), (cos (s) ft, (1,2)), (sin (a;) ft, (3,1)), (sin (s) ft, (1, 3)) . 

It is immediate to verify that these vector fields satisfy the assumptions of 
Theorem 15. 3[ but (for the reasons mentioned above) the methods of [NSW85 
are insufficient to study the balls generated by these vector fields. 

5.2.4 Lifting results from the single parameter case and the Campbell- 
Hausdorff formula 

In this section, we discuss a general method whereby one may lift many results 
from the single parameter setting of [NS W85| to the multi-parameter setting in 
this paper. 

To make this methodology clear, we present a concrete example where it 
applies. Indeed, this example is interesting in its own right. 

We suppose that we are given generating C°° vector fields on C R n , with 
v parameter formal degrees, 

(W 1> d 1 ),...,(W r> d r ). 

For a word w ~ (wx, ...,w{), wj g {1, . . . ,r}, we define: 

i 

i=i 

24 As a consequence, if one is only interested in vector fields with polynomial coefficients, 
then the methods of INSW85| (with some adjustments) are sufficient for most purposes. 
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X w = ad (X Wl ) • • • ad (X W i-i) X Wl . 
Let (X, d) = (Xi,di) , . . . , (X g , d q ) denote the list of vector fields with formal 
degrees given by \X W , d(w)J where w — (wi, . . . , wi) and I < M for some fixed 
large M. Our goal is to show, under the smooth version of the hypotheses of 
Section UTTl that the balls 

B S d W (x) 

are comparable to the balls 

B (x,d) (x,S) . 

More specificly, fix xq € fi, and assume (X, d) satisfies C (xo, £)• We assume 
that we have, for every 5 € [0, l)" with S < £, 



[S^Xi^Xj] =J2<$>js d >x, 



on B(x,d) (^0; In what follows, an admissible constant may depend on upper 
bounds for q and n, lower and upper bounds for the | - j -j_ norms of the formal 
degrees, upper bounds for a finite number of the norms ll-^;'llc m (-B ( x d)(^o S)) 
and upper bounds for a finite number of the norms: 



sup £ Wx) a c^ 



\ a \S ni 



C«(B (Xid) (x ,5)) ' 



which we assume to be finite-in fact, we only need the above bounds for m 
which can be chosen to depend only on M and q. 
We have, 

Theorem 5.9. There exists an admissible constant r)' > such that for every 
5 < £, we have: 

B(x,d) (xo,v' s ) ^ B 5 d W (x ) C B {x ,d) (xq,S) . 



The second containment in Theorem 15.91 is obvious, and so the theorem is 
really a statement about the first containment. In the single parameter case, 
Theorem 15.91 was shown in NSW85 . Specificly, we have: 

Theorem 5.10 (Theorem 4 of NSW85 ). In the case v = 1 and when X\, . . . , X q 
span the tangent space, Theorem \5.9\ holds-so long as we allow admissible con- 
stants to also depend on a lower bound for: 

det X (xq) 

nxn 

Actually, in Theorem 4 of NSW85 , W\ , . . . , W r are each given the formal 
degree 1, but this is not an essential point, and the methods there immediately 
generalize to give Theorem 15.101 It is worth noting that the proof in [NSW85 
uses heavily the Campbell-Hausdorff formula, and therefore use of a lower bound 
for |det nx „ X {xq)\ is essential for those methods. 
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Proof of Theorem \5.9l Apply Theorem 15.31 to obtain $,5 , 771 and £2 as in that 
theorem. To prove Theorem 15.91 it suffices to construct an admissible constant 
r( > such that: 

B (X,d) (X0,V'8) Q B (£ 2 S) d W ( X o) 5 

rephrasing this, it suffices to show, 

B (<5<*x,£ d) (xo, rf) C B^ 2g y w (xo) , (5.10) 

for some admissible 77' > 0. Let Y denote the list of vector fields given by the 
pullback of S d X under the map &s to B no ^ (771), and let W denote the list of 
vector fields given by the pullback of S d W under $5. Pulling back (|5.10l) via 
$5, we see that it suffices to show that, 

%Ed)(0.'/)Cfl^ v ,(0). (5.11) 

However, using that the W generate the Y (since this is just the pullback of 
the statement that the W generate the X), using that \det no ^) xn (S) Y (0)| > 1 
(this follows from (|5.3p ). and using £ 2 ~ 1, we may apply Theorem l5.10l (in the 
special case when S s» 1) to deduce (15 . 1 1[) . completing the proof. □ 

In conclusion, if one can prove a result in the single-parameter setting of 
[NSW85], one often gets a multi-parameter result "for free," merely by pulling 
the multi-parameter vector fields back under the scaling map $5 and applying 
the single-parameter result. In particular, this allows one to use the Campbell- 
Hausdorff formula to prove results in the multi-parameter setting. This same 
proof method shows that the error term for the Campbell-Hausdorff formula as 
discussed in Section [l . 2 . 1 1 can be controlled in an appropriate sense, even in the 
multi-parameter setting. 

5.3 Control of vector fields 

In Section \4. 11 we saw that the conditions imposed on the commutators [Xi, Xj] 
in SectionSJwere closely related to three equivalent conditions that were defined 
in Section 14.11 (see Remark I4.20[) . The goal in this section is to understand the 
conditions imposed on the commutators in Section 15. II in a similar way. To do 
so, we will lift two of the three equivalent conditions from Section I4TT1 into the 
setting of Section 15.11 These equivalent conditions are interesting in their own 
right, and will play a role in future work. 

We take all the same notation as in Section 15. 1[ and define (m-) admissible 
constants in the same way. Let X q+ \ be a C 1 vector field on f2, with an as- 
sociated formal degree ^ d q+ i € [0,00)". We will introduce conditions on 
(Xq+i, dq+i) which will imply (informally) that one does not "get anything 

new" if (X q+ i,d q+ i) is added to the list (X, d). Let (^X,ctj denote the list 

of vector fields with formal degrees (Xi, d\) , . . . , (X q+ i,d q+ i). For an integer 
m > 1, we define two conditions (all parameters below are considered to be 
elements of (0,oo)): 
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1. VFiKuTuaua?): 

• V<5 e A,x e K, |det„ o(:(;i5 ) xno ( X; 5) (SX) (x)^ > m 

• Vx € K, detjxj X (x) = 0, no (x, 5) < j < n. 



det„ ( Xi5 ) x „ ( Xi 5) (SX^J (x) 



V5 eA,xe K^c!'*^ e C° (B {X4) (x,nS)) such that 



9+1 



[P'Xi, 6 d ^X q+1 ] = ^ X t , on (x, n<5) . 



with 

|a| <ro— 1 



C">(B (x , <J) (x,n,5)) 



< CT 1 



C°(B (x ,d)(x,-rL«)) 



< (71- 



2. (r 3 ,<T 3 ,a^): For every x eif,^i, there exist € C° (x ,t 3 (5)) 
such that: 

. 6 d <+*X q+1 = ELi on B (JCid) (x,t 3 <5). 



|a| <r 



(Or) a c*' 5 



C (S(X,«i)(x,T3<5)) 

C0(B( Xid) (x,r 3 5)) 



< 



CT 3 



< cr 3 . 



Theorem 5.11. "P™ V™ m i/ie following sense: 

1. V™ («i, ti, cti, cr™) =>■ t/iere exisi admissible constants r 3 = t 3 («i, n, cti), 
°"3 = C3 (ki,cti), and an m- admissible constant a™ = cr™ (k\, ct™) swc/i 
thatVfir^a^af). 

2. V™ (r 3 , cr 3 , cr™) i/iere exis£ admissible constants K\ — K\ (<t 3 ), <ti = 

CT i (03) an m- admissible constants™ — a™ (o - ™), suc/i thatF™ (ki, t 3 , o"i, <t™). 

Furthermore, if < dg +1 is a /ixed /ower bound for i/ien under the 

condition V™ (r^^a^^a^), we have that there exists an admissible constant t' = 
t' (dg +1 ,T 3 ,<7 3 ) smc/i iftat: 

(a.r'tf) C (x,r'<5) C B(x,d) (a, 73 J) 

/or every ieK,(SG^l. Finally, ifrj < r)\ is small enough so that & Xt s (B no ( Xt s) ("')) ^= 
B(x.d) (xo, T 35) and we define to be the pullback of 5 dq+1 X q+ \ under & x .s 

to B no{x j) {r}'), then, 



v x,5 



C"»(B B0( . l4) (i,')) 



< <7 4 



where a™ = a™ (c™) is an m-admissible constant. 
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Proof. Merely apply Theorem 14.171 and Propositions 14.181 and 14.191 for each 
x € K, 5 € A, to the list of vector fields (SX,^2d), taking xq — x and Jo = 
J (x,S). ^ □ 



Remark 5.12. Our assumption on the commutator in Section 15.11 was 

essentially that ([Xi, Xj] , di + dj) satisfied condition V™ for appropriate m. 

Definition 5.13. We say a vector field with a formal degree (X q+ i, d q +i) is m- 
controlled by the list of vector fields (X, d) provided either of the two equivalent 
conditions V™ or V™ holds. We say (X q+1 ,d q+1 ) is oo-controlled by (X,d) 
if V™ («i, n, o~±, fi™) holds for every m, with ki,ti, and o~\ independent of to 
(equivalently if T 3 ™ (T3, 03, tr™) holds with T3 and 173 independent of to). 

5.3.1 Examples of control 

For this section, we take all the same notation as in Section T5. 11 and assume that 
A is given by (|5.ip . As was mentioned in Section lS~51 fsee Remark [5. 12 1) our main 
assumption in Theorem l5.3l is essentially that the commutator ([Xi, Xj] , di + dj) 
is "controlled" by (X, d) in the sense of Definition 15.131 

In [NSW85 a stronger assumption was used in the single parameter case 
(see Section [1.2.1j) . The most obvious multi-parameter analog of this assumption 
is the following: 

[X h X j }= J2 <3 X ^ ( 5 - 12 ) 

where the inequality is meant coordinatewise, and the Cj ,• are assumed to be 
sufficiently smooth. It is easy to see that this assumption is a special case of 
the assumptions in Section [5. II indeed, one can take 

c k, x , s , = f')" " " <■::, if d k <di + d h 

l ' J I otherwise. 

One may wonder whether it is possible to get away with such simple assumptions 
in applications. This seems to not be the case, and to exemplify the possible 
difficulties, in this section, we give examples where the closely related notion of 
control takes a more complicated form. 

Example 5.14. This example takes place on K 2 with the vector fields: 

Xi = d x , X 2 = xd y , X 3 = d y . 
Create two copies of these vector fields: 

x l j x 2 , xi 

j = 1, 2, both copies acting on the same space. We take v — 2 and assign the 
formal degrees in [0, 00) 2 as follows: 

(Xl (1, 0)) , (Xl (1,0)) , (X 3 \ (2, 0)) , (Xl, (0, 1)) , (Xl (0, 1)) , (Xl (0, 2)) . 
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It is clear that: 

([A?,*!], (1,0) + (0,1)) =(dy,(l,l)) 

is oo-controlled by the other vector fields. However, it is easy to see that it 
cannot be written as in (|5.12j) . In this case, one could just throw in the vector 
field (d y , (1, 1)), and then the list of vector fields would satisfy (|5.12[) . but this 
is not the case in the Example 15. 161 below. Furthermore, this process of adding 
in vector fields is counter to the way in which we proceed in Section [6j 

Example 5.15. Consider the vector fields with single-parameter formal degrees 
on R given by: 

{d x ,2),(x 2 d x ,l),{xd x ,1.5). 

Denote them by (Xj,dj), j = 1,2,3. We restrict our attention to |a;| < 1. It is 
clear that for every |<5| < 1, 

[S**Xi, S^Xj] = c*j5 d *X k (5.13) 

k 

with c^'j £ C°° uniformly in 8. We claim that (xd x , 1.5) is oo-controlled by the 
other two vector fields. Indeed, fix xq,8, with 6, \xo\ < 1. By (|5.13[) it suffices 
to show that: 

\(6 2 ,x 2 6)\ oo >\(6 2 ,x 2 6, Xo 6 i - 5 )\ oo . ( 5 - 14 ) 

Suppose S 15 \xq\ > 8 2 . Then |xo| > VS. Hence, <5 15 |xo| < 6|xq|, completing 
the proof of l|5.14|> . 

What this example shows is that we can write 

6 15 xd x = c*°' s 5 2 d x + c% 0,5 Sx 2 d x , on B^ x ,d) ( x o, t 3 S) , 

where T3 can be chosen independent of xq, S. Note that the choice of c^ ' 6 , c^ ' 5 
depends on xq and S in a way which is more complicated than arises from (|5. 12[) : 
it depends on the ratio of \xo\ and V~5. 

Example 5.16. Consider the vector fields with formal degrees on R: 

(fit, (0,0)), (&, (0,6)) 

here a, b > 0. In this example, we take A = {0 7^ 5, \5\ < 1}. We will show that 
the above two vector fields oo-control (d x , (c,d)) if and only if the point (c,d) 
lies on or above the line going through (a, 0) and (0, b). Here, c, d are any two 
non-negative real numbers, at least one of which is non-zero. 

By replacing 5 — (61,62) with (df it is easy to see that it suffices to 

prove the result for a = 1 = b. Hence we need to show that (d x , (c,d)) is oo- 
controlled by the above two vector fields if and only if c + d > 1. However, it is 
easy to see that: 

{6i,6 2 ) (c ' d) <Cmzx{8 u 6 2 } 
for all 6 sufficiently small, if and only if c + d > 1 . The result follows easily. 
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6 Unit operators and maximal functions 



In this section, we wish to study maximal operators associated to a special case 
of the multi-parameter balls from Section 15.11 

Suppose we are given v families of C 1 vector fields on fi with associated 
single-parameter formal degrees: 

(X",d») = ((Xf ,<),..., 6(0, oo), l<^<u. 

We may associate to the X^s and d M s a family of vector fields with (multi- 
parameter) formal degrees. Indeed, let (X, d) denote the list of vector fields Xj, 
1 < fi < v, 1 < j < q^, with the degree of X^ the element of [0, oo) u which is 
dj in the /xth coordinate and is in all other coordinates. Define K <s fi and £ 
as in Section T5.2.21 in terms of (X,d). We assume that the list of vector fields 
(X,d) satisfies all of the assumptions of Section l5~T1 (with A given by (|5.1|) ). 
without adding any new vector fields to the list (X, d) o We define admissible 
constants in the same way as they were defined in Section 15.11 We define, for 
Se (0,1]", 5<£, x€K: 

B(xi-,d^),...,(X",d") (x,S) := B( X> d) (x,S) ■ 

We present two interesting examples that satisfy the hypotheses of this section: 

Example 6.1. Suppose X\,...,X m are C°° vector fields satisfy Hbrmander's 
condition. Use these to generate a list of vector fields with formal degrees 
(X,d) as in [NSW85] (see Section [T2~T]) . Let v = 2 and let (X,d) be the list 
of vector fields corresponding to fi = 1. Then let (<9i, 1) , . . . , (d n , 1) denote the 
list of vector fields corresponding to [1 = 2. I.e., fi = 2 corresponds to the usual 
Euclidean vector fields. These then satisfy the hypotheses of the section. The 
main idea in this example is that one may write: 

ix i ,d j ] = j24A> 

k 

where the af j 6 C°°. One must be careful with this example. It is tempting 
to think, given the results in Section I5.2.2[ that one could take any two lists 
satisfying Hormander's condition, removing the assumption that one of the lists 
corresponds to the usual Euclidean vector fields. This is not the case, since the 
procedure in Section [5.2.21 involved adding more vector fields to the list (X, d), 
namely the commutators involving vector fields from both lists. In this case, 
though, this procedure is not necessary, due to (|6.ip . 

Example 6.2. Let (X^,d^) be v lists of vector fields with formal degrees such 
that for each fixed /i, (X 11 , d^) satisfies the hypotheses of Section I5~T1 (with v = 1 
and A given by (|5.ip ). Suppose further that for ^ [X^fXj 2 ] = 0. 
Then these vector fields satisfy the hypotheses of this section. In particular, 

25 In particular, this implies that the (one-parameter) list of vector fields (Jf",^) satisfies 
the assumptions of Section [5.1l for each /Ltrj . This can be seen by taking 8^=0 for every fi ^ /iq . 
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when working on a homogeneous group, one could take v — 2, and let the /i = 1 
vector fields correspond to a homogeneous basis of the left invariant vector fields 
(with degrees corresponding to their homogeneity) and /j = 2 be a similar list 
but instead with the right invariant vector fields. This was the setup in |Str08j . 
and is discussed in Section n.2.41 

To motivate the results in this section, let us consider a classical example. 
In this case Q, = W, = 1 for all 1 < fj, < v, and (Xf , d£) = (d^, 1). We have 
the classical "strong" maximal function in M?. This is given by: 

Mf(x) := sup — — — — y I \f(z)\dz. 

«=(*!,...,*„) Vol (%,i),.„,( 8 „,i) {X,6)) JBiSj i) ... (b„,i) (k,<5) 
Sj>0 

(6.2) 

Rewriting (|6.2|) in the notation of Section |4j we have: 

Mf (x) = supA S(8ijl)i ... i(8i/ ,!)(.,«) |/| (a;) . 

Perhaps the easiest way to deduce L p boundedness (1 < p < oo) for M. is 
the idea of Jessen, Marcinkiewicz, and Zygmund JMZ35 to bound M. by a 
product of the one-dimensional maximal functions, whose L p boundedness is 
already understood. To do this, one proves the simple inequality, that there 
exists a A > such that for every S, and every / > 0, we have: 

A B {aiA) ( a„,i,(-,A5)/ < CA B{guA)[ . M ■ ■ ■ Ab (9i 1)( . A) / 

And then it follows immediately, that: 

Mf(x)<CMu---M 1 f(x), (6.3) 

where 

M fl f{x) = sup A B (. >Sli) |/|. 
In this section, we wish to generalize (|6 . 3|) . 

Theorem 6.3. There exist admissible constants, < ti < t\ < 1, a > such 
that for all \S\ < a, we have, for f € C (f2), / > 0, x G K: 

A B ix x, d i ) ,...,c X ,, dn i;T2S)f(x) < A B{XV dU){ .^ lS) ■ ■ ■ A B{xl dl)[ . tTlS) f (x) 

~ A B {xl , d i } ,..., (x „, dn {;6)f(x)- 

Proof. Apply Theorem H2H with {Z^,d^ L ) = (5 fl X^,d 1 '), (X,d) = (8X,d), and 
xq = x, where x € K . We obtain admissible constants Ai, A2, A3 independent of 
x, 5 as in that theorem. 

To conclude the proof, merely take t\ — j 2 -, r 2 = and replace 5 with 
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Corollary 6.4. There exist admissible constants < T2 < T\ < 1, a > sitc/i 
£/ia£ £/ we define, for x G A", / € C (f2), 



X/ (as) = sup ^s (xl , dl) ,..., (x „, d „ ) (,5) l/l (a 

|(5|<r2cr 



and /or all x € fl such that B(x^,d^) { x i Tier) C O, 

AW 0*0 = SU P A s (X m i(1) (.,<V) l/l («) ■ 

0<5 M <T 1( T 

Then we have: 

Mf{x)<M v M v -x---Mxf{x). (6.4) 
Proof. This follows directly from Theorem 16.31 □ 



Corollary 6.5. Let M be defined as in Corollary \6.J\ Then, by possibly admis- 
sibly shrinking T2, we have that M. extends to a bounded map L p (fi) — » LP {K), 
for every 1 < p < 00 . 



Proof. This would follow from (|6 .4[) , provided we have that VW 1 , . . . , A4 „ extend 
to bounded operators on LP (1 < p < 00). Intuitively, this is simple, since the 
one-parameter balls B^x^.d^) (")<^) satisfy the doubling condition (|5.5I) . and we 
expect to be able to apply the theory of spaces of homogeneous type to conclude 
the desired L p boundedness. There is a slight technicality, though, since if (for 
a fixed (1), the vector fields do not span the tangent space, then the balls 
B(x»,d») ('j^m) d° n0 ^ en dow with the structure of a space of homogeneous 
type: rather, they foliate f2 into leaves, each of which is (locally) a space of 
homogeneous type. The technical details to deal with this difficulty are covered 
in Section El □ 



Remark 6.6. Notice, in Corollary I6.5[ we have left out p — 00. This is because 
if the vector fields do not span the tangent space, the maximal operators may 
only be a priori defined on C (57). 

Remark 6.7. As mentioned in Section fl. 2 .41 it is likely that M. is bounded on 
LP (1 < p < 00) for a larger class of balls than is discussed in this section. 
However, Theorem 16.31 is very tied to the assumptions of this section. 



Corollary 6.8. There exists an admissible constant a\ > such that if for 
\5\ < o~i, we let Tg denote the Schwartz kernel for the operator: 

then, for x G K , T$ (x, y) is supported on those points (x, y) such that: 

inf {r > : y G B ( xi,d 1 ),-,(X»,cr) ( x , tS )} £ h (6-5) 

and moreover, 

supTs(x,y)~ — . (6.6) 

y Vol (B( X \d 1 ),...,(X",d'') OMJJ 
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Furthermore, there exists an admissible constant 02 > such that: 

T s (x, y) « - , r^vT' y e (a, cr 2 S). 

Yo\{B {x i 4 i h „^ {X u^ ) {x,8)) 

(6-7) 

These inequalities are understood to be taking place on the leaf generated by 6X 
passing through x. 



Proof. (|6.5p and the < part of (|6.6p follow by applying Theorem 16.31 and using 
the inequality (for / > 0): 

A B ( x^,^)l'.n«) ' ' ' A B {xl _ dl) (;r 1 s)f (x) < A B{xl dl) {XV dU) (.,s)f (ar) 
and renaming t\6, 6. With this new 6, the other half of Theorem 16. 31 now reads: 

thereby establishing (|6.7|) and therefore the > part of (|6.6[> . This completes the 
proof. □ 



Corollary 16.81 has an interesting corollary, to which we now turn. For the 
statement of this corollary, we restrict our attention to the case v = 2, but 
otherwise keep the same assumptions and notation as in the rest of the section. 

Corollary 6.9. Suppose the leaf generated by X 1 passing through xq is the same 
as the leaf generated by X 2 passing through xq (call this common leaf L). Then, 
there exists an admissible constant o\ > such that for every xq € K and every 
5 := (61,62) with \6\ < o\, we have: 

,.,/„ f , .y, Vo\(B {x ^ d i ) (xo,6 1 ))Vol(B iX 2^ d 2 ) (x ,6 2 )) 
Vol (x , 61) n B {X 2 d 2 } (x ,S 2 )) ps - i . 

Here, Vol(-) on the left hand side denotes the induced Lebesgue volume on L. 

Proof. In the following, dy will denote the induced Lebesgue measure on L, and 
for a set A, \A will denote the characteristic function of A. 



Vol (-B(xvi) Oo, Si) n B (X 2 4 2 } (x ,S 2 )) = J XB {xlidl) {x a ,Si) (v) XB {x2 _ d2) ( y ,s 2 ) ( x o) dy 
= Vol (5 (x vi) (x ,6i)) Vol (B(x=,(p) (so,^)) 

^-^yXs (xlidl) (*oA) (y) vol (S (x2 , d2) (x ,^)) XB (- 2 - 2 ) ( ^ 2) (Xo) dV ' 



Vol (-B (X i id i) 



In the above, we have used that y G £?nc 3 ,d 2 ) (^o,^) if and only if xo € 
B( X 2 : d 2 ) {Vi 62)- We use the fact that if ?/ <E B( X 2 d 2^ (xo,^), then 

Vol (B( X 2 A 2) (x , 6 2 )) « Vol (S(x2, d 2) (y, <5 2 )) , 
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which follows from Theorem 15.31 in particular (|5.5p . We then have that the 
RHS of (EH is: 



~ Vol (x ,Si)) Vol (B (x *,d 2 ) (x , 8 2 )) 

f 1 1 

= Vol (B^x^d 1 ) (xo,Si)) Vol (B [x *,d?) (xq,5 2 )) T s (x ,x ) ■ 



Where Ts is as in Corollary 16.81 Now the result immediately follows from 
Corollary EH □ 



6.1 Some comments on metrics 



Corollarv l6 . 81 has a corollary which can be phrased in terms of metrics, and may 
serve to give the reader some intuition for these results. We devote this section 
to this corollary, and maintain all the same notation as in Section [5] 

Fix r — (ri, . . . , rj) £ (0, 1]", and assume = 1 for some \i. Corresponding 
to each such r, we obtain a one-parameter family of balls: 

B (x,d) (x,Sr) , 

for a: e ft. This one-parameter family of balls is associated to the Carnot- 
Caratheodory metric p r , associated to the vector fields 

{ (r? X?, d£) : 1 < M < v, 1 < 3 < ■ 

This metric is defined by: 

Pr (x,y) := inf {6 > : y £ B {x ,d) (x,Sr)} . 

Remark 6.10. Actually, it could be that p r is not a metric, in that if the Xj do 
not span the tangent space, the distance between two points might be 00. This 
will not affect any of the results in this section. 

Remark 6.11. We assumed that max M = 1 since, if we drop this assumption, 
we have: 

5Pr = PS- 1 r 

and so every choice of r can be reduced to the case when max p r M = 1. 

For each fi we also obtain a metric, the Carnot-Caratheodory metric associ- 
ated to the vector fields 

{(X?,d?):l<j< q »} 

given by 

(x,y) ■= inf {8 > : y £ B( X »,df) OM)} , 
where we have the same caveat as in Remark l6.10l 
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Given two functions Ai, A2 : fi X O — > [0, 00] (one should think of Aj., A2 as 
metrics) we obtain a new function: 

(Ax o A 2 ) (as, z) := inf Aj (x, y) + A 2 (y, 2) . 

One should think of Ai o A 2 as the "distance" between x and z if one is first 
allowed to travel in the Ai metric and then in the A 2 metric. Of course, even if 
Ai and A 2 are metrics, Ai o A 2 may not be symmetric, and therefore will not 
be a metric. 

However, in the case of the p^ above, we do end up with a quasi-metric. 
Indeed, we have: 

Corollary 6.12. There is an admissible constant er 2 > such that for every 
x € K and y £ Q, such that p r (x, y) < cr 2 , we have: 

[(/TVi) (r 2 V2) • ■ ■ (r^Pv)] (x, y) w p r {x, y) . 

Proof. > is obvious. < follows from Corollary 16. 81 □ 

Remark 6.13. When [Xj 11 , X^ 2 ] = for p,\ ^ p 2 (and with a slight modification 
in the definition of our metrics), one actually obtains equality in Corollarv l6.12l 
(as opposed to «). This follows from the proof method in Section 4.1 of [Str08 . 



6.2 Foliations whose leaves are locally spaces of homoge- 
neous type 

Let (Xi, dx) , - • • j (X q , d q ) be vector fields on an open set C M. n with single- 
parameter formal degrees d\, . . . , d q € (0, 00). Under the (single-parameter ver- 
sion of) the hypotheses in Section UTTl the balls B^ x ,d) ( x , 5) satisfy the doubling 
property that is crucial to the theory of spaces of homogeneous type: 

Vol (B {X:d) (x Q ,2S)) < Vol (B {x>d) (x , 6)) , (6.9) 

see (|5.5p . This leads one to consider the maximal operator given by, 

Mf(x) = supA B { s) l/l (x) = sup 1 / |/(y)| dy, 

<5>0 ' ■ ' 5>0 Vol {B( X ,d) {X,d)) JB ix>d) {x,&) 

where the supremum is only taken over 5 sufficiently small. If the vector fields 
X\, . . . , X q spanned the tangent space at every point of fl, the balls B( X .d) (x, 6) 
would be open sets of positive Lebesgue measure and (|6.9I) would imply that 
they do, in fact, endow f2 with the structure of a space of homogeneous type. 
Classical arguments then show that Ai extends to a bounded operator on L p 
(1 < p < oo)-this is essentially the situation covered in |NSW85j . 

However, if the vector fields do not span the tangent space at each point, then 
the balls B( X ,d) ( x , S) do not turn f2 into space of homogeneous type. Indeed, at a 
point xo where X\, . . . , X q do not span the tangent space, the ball B(x,rf) (xo, 5) 
does not even have positive n dimensional Lebesgue measure: it lies on the leaf 
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passing through xq generated by X\,...,X q . This does not prevent M from 
extending to a bounded operator on L p (1 < p < oo) however, as we shall 
see. Informally, the idea is that X\, . . . , X q foliate f2 into leaves, where each leaf 
(endowed with the induced Lebesgue measure) is locally a space of homogeneous 
type, and the standard theory of maximal functions may be applied. It is crucial, 
here, that we are working locally (i.e., that we are restricting our attention to 
5 > small). If we had not restricted our attention to local results, the space 
of leaves might be quite complicated, to the extent that it would be difficult (if 
not impossible) to lift the L p boundedness of M from each leaf to f^F^l 

Remark 6.14. Near a non-singular poiniF^I of the involutive distribution spanned 
by Xi, . . . , X q , the boundedness of M. follows immediately. Indeed, in this case, 
the foliation looks locally like a product space. The maximal function just acts 
on one of the product variables, and in this variable the balls form a space of 
homogeneous type. Thus, the main point of this section is to demonstrate an 
easy way to deal with singular points; however, it will not be necessary for us to 
make any distinction between non-singular and singular points in our argument. 

We now turn to a formal statement of our results. We are given a compact 
set K <s £1, and £ > such that (X,d) satisfies C (x, £) for every x £ K. We 
assume for every 5 < £, x E K, we have: 

k 

on B/x,d) ( x >$)- I n addition, we assume: 

• ThcX,sareC 2 on B( X ,d) i x ,Q> for every a; € K, and satisfy sup xeK \\ x j\\ c ^(b {x d) (x,o) < 
oo. 

• For all \a\ < 2, x € K, we have (S d X) a c*f x € C° (B (x ,d) (x,S)), for 
every k, and every S E A, and moreover: 

sup V \\(6 d X) a c*f x , <oo. 

Finally, let 

n (x,S) = dimspan{<5 dl Xi (x) ,S dq Xq (x)}. 

We say C is an admissible constant if C can be chosen to depend only on 
fixed upper and lower bounds d max < oo, d m i n > 0, for d\,...,d q , a fixed upper 
bound for n, q and a fixed upper bound for the quantities: 

supj^ll^^), ^^Wjt) a ^\ 

2e Consider, for instance, the vector field d x + Qd y on the manifold M = R 2 /Z 2 , where 
9 S M \ Q. In this case, if we denote by C the space of leaves, we have L p (£) = C. Locally, 
however, M with this foliation just looks like a product space, and the corresponding maximal 
function is just the standard maximal function along one of the variables. 

27 Xq S f2 is said to be a non-singular point if dimspan{Xi (a;) , . . . ,X q (x)} is constant in a 
neighborhood of xq. 
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Theorem 6.15. There exists an admissible constant £' > 7 £ ' < £, smc/i that 
if we define, for f € C (O), x <E K , 

Mf(x) = sup A B(Xd)M) |/| (z) , (6.10) 

o<<5<e 

tten /or every / G C (il) fl L p (O), 

/or every 1 < p < oo . Here, C p is an admissible constant which may also depend 
on p. 

The main assumptions of this section are equivalent to saying that Theorem 
15.31 applies to the vector fields (X, d). Let £i,»ji be admissible constants as in 
the conclusions of Theorem 15.31 Define, 

«" = U (*■ |) ■ 

xeK ^ ' 

Theorem 16.151 will follow from the following two propositions. 

Proposition 6.16. There exists an admissible constant £o > 0, £o < C> such 
that for every £' < £o o,nd every f € C (CI) with / > 0, we have: 

[ f(x)dx<[ A B Ul) f(x)dx< [ f(x)dx, 

where the implicit constants are admissible but also allowed to depend on a lower 
bound for £' . 

Proposition 6.17. We have the pointwise bound, for 1 < p < oo, < £' < 
and x € f2": 

where the implicit constant is admissible and can also depend on p and a lower 
bound for but not on x. 

Proof of Theorem \6.15\ given Propositions 1 6. 1~6\ and \6.17\ Fix p > 1. Let £o be 
as in Propositions 16.161 Fix £' > an admissible constant, such that £' < 
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Take / £ C (ft) n U> (Q), and consider: 



\\Mf\\ p LP{K) = / 



A" 



< 



< 



< 



<\\f\ 



A B{X ^)\Mf\ p {x) dx 

A B( X , d) (;2?) UTO) dx 

\f(x)\ p dx 
p 

Lp(Q.) ' 



completing the proof. □ 
We now prove Proposition 16. 161 To do so, we need two lemmas. 

Lemma 6.18. There exists an admissible constant rp > such that for every 
< rj' < rp , and every f £ C (f2) with f > 0, we have: 



f (x) dx 



< 



<K {'^ r l') q J\t\<j]' J ft" 

Proof. Note, for \t\ < we have 

n' 2 e t x n" d k. 

To make use of this, we choose rf < Furthermore, by taking rp > 
admissible small enough, we have for all |t| < rp , and all x £ ft" , 



f (e t - x x) dxdt< f (x) dx 



det-^-e tx x 



> 



This follows since the C 2 norm of e t x x — x is admissibly bounded (see Theorem 
lA.ip and because when t — 0, e t ' x x = x. 

Putting these results together, we have from a simple change of variables, 
for \t\ < rp, 



f (x) dx 



< 



K 



f (e t - x x) dx< f (x) dx. 



Averaging both sides over \t\ < rj' yields the proof. 



□ 



Lemma 6.19. Let rp be as in Lemma \6.18\ There exists an admissible constant 
£o > such that for every < £' < £q, there exist admissible constants < rj" = 
r\" (£'), < r/', n" ', rj' < rp such thal^ for every f £ C (fl), f > 0, we have: 



\t\<v" 



f{e tx x) dt<A B(x>d)UI) f(x) 



< 



f (e* X x) dt, 



\t\<v' 



28 Here, rj" can be chosen to depend only on a fixed lower bound for f. 
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for every x € fi'. Here, the implicit constants are allowed to depend on a lower 
bound for £' . 

Proof. This follows just as in Proposition 14.221 The straightforward modifica- 
tions are left to the reader. □ 

Proof of Proposition \6.16\ Take 77°, £0 as in Lemmas 16. f 81 and 16. f 91 For £' < £0, 
let 77', 77" be as in the conclusion of Lemma lB.l 91 (here and in the rest of the proof, 
all constants are allowed to depend on a lower bound for so that, in particular, 
rj" > 1). We then have, using Lemmas 16. 181 and 16.191 freely, for f E C (O), with 
/>0, 

r f (x) dx < [ [ f (e t - x x) dx dt 
k J\t\<"n" Jw 



< 



V." 



A B ( x,d)(-,f')/ ( x ) dx 



< / / f(e tx x) dxdt 

J\t\<-q' JO." 



< 



f (x) dx, 



which completes the proof. □ 

Proof of Proposition \6.17\ Fix x £ Jl" and £' as in the statement of the propo- 
sition. In what follows all implicit admissible constants are also allowed to 
depend on a lower bound for We define the maximal function M. in terms 
of this fixed as in (|6.10j) . By definition of SI", there exists xo € K such that 
B(x,d){x,C) Q B(x t d)(xo,£i). Let n = dimspan{Xi (x ) , . . . , X q (x )}, and 
let $ : B no (r/i) — > B( X ,d) i x o,£) be the map guaranteed by Theorem 15.31 where 
we take 5 = £ and x = Xq. Note that, 

B ( x,d) M) C B (XA) (x ,£i) C $ (B„ (770) . 

Let Y\, . . . , Y q be the pullbacks of X\, . . . , X 9 via the map 3>, to B no (77). 
Note, for u € i?„ (77) and 5 > small enough that B(y,d) (u,S) <e _B„ (77), we 
have 

$(B {Y4) (u,S))=B (Xtd) (*(«),*). 

Using that |det„ oX n d$ (u)| « |det„ oX n X (xo)|, and applying a change of vari- 
ables as in (|B.2|) . we see that 



Vol (B (X4) (*(u),*)) 



det X (xo) 

n xn 



Vol {B {YA (u,S)) 
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It follows, for / e C (fl), with / > 0, 



A B ( . tS) (fo§ X ) ($ (U)) = — y- \ . , 

Vol [B {x>d) ($ (u) , J)) 7 B(X d) (4>(u)i(5) 



1 



Vol (B(Y, d ) (U,5)) J BiY , d) (u,5) 



/(«) rf« 



where, again, we have used (IB.2[) and <ii> denotes Lebesgue measure on B no (77) 
and c?y denotes the induced Lebesgue measure on the leaf in which Brx,d) i u ) 1 

lies. Consider, for y 6 B^x.d) (- T o, 4f ^ , 

-M(/°* _1 )(t/) = sup A fl I/O*" 1 ] (y) 

c>5>o 

« sup A B ( S ) I /I (j/)) 

£'><5>0 

=: M/ (S" 1 (y)) , 

where 7W denotes the maximal function defined in terms of the Carnot-Caratheodory 
balls defined by the vector fields ( Y, d) . 
Hence, we have, 



A, 



■(X,d) 



{ , e) \M(fo$-i)\ p (x)*A B . x> , e) 



A 



Mf 
Mf 



H 1 

o 

(X)) 



(,;) 



Similarly, we have 

A B(xM .,2e) \f° *~T (*) « l/l p (sr 1 (*)) • 

Thus, to complete the proof, it suffices to show the bound 



.4 



Mf 



(^(x))<A BlvM , 26l) \f\ p (9-Hx)) 



Moreover, since Vol (B {Y4) (»,£')) ~ Vol (B {YA) (fc" 1 (x) ,2?)) (in fact 
both are « 1, but we will not need this), it suffices to show 



Mf 



< 



ll/ll' 



Lp(B {Y:d) (<S>-i(x),2(,')) 



This is immediate from the classical theory of spaces of homogeneous type, 
since the balls Bry<j) {'■,$) satisfy all the axioms of a space of homogeneous 
type, uniformly in the relevant parameters. □ 



78 



A Two results from calculus 



In this appendix, we discuss two theorems from calculus that we will use through- 
out the paper: a uniform version of the inverse function theorem, and how the 
smoothness of e tlXl+t2X2 ~ { "Xq, as a function of t\, . . . , t v , depends on the 

smoothness of X\, . . . ,X V . These results are surely familiar, in some form or 
another, to the reader. However, they play such a fundamental role in our anal- 
ysis, that we feel it is prudent to state them in the precise form we shall use 
them. 

For a C 1 vector field Y, one defines E (t) — e tY xo to be the unique solution 
to the ODE j- t E{t) = Y (E (i)) satisfying E (0) = x . This unique solution 
always exists for \t\ sufficiently small (depending on the C 1 norm of Y). This 
allows us to define: 

e t lXl+ ... +tvXvxo 
for \t\ sufficiently small, where t = (ti, . . . , t v ). We have: 

Theorem A.l. Suppose X\, . . . ,X V are C m vector fields (m > 1), defined on 
an open set fi C K™. Then, for xq fixed, the function: 

u{t) = e tlXl+ - +t » x »x - x 

is C m . Moreover, the C m norm of this function can be bounded in terms of n, v 
and the C m norms of X\, . . . , X v . 

Proof. It is perhaps easiest to consider the function: 

Then, u (t) — v(l,t) — xq, and v is defined by an ODE in the e variable. 
That v — x Q is C m (in both variables) is classical. See Die60 , Chapter X. 
Alternatively, one can modify the proof method in |Izz99j to this situation for 
a more elegant proof. □ 

Remark A. 2. One could write t in polar coordinates r, ui, and consider the 
function, f(r,uj) — u(ruj). Then, one has, / G C m (r, uj). Moreover, one has, 
for a + \b\ — m (a G N, b a multi-index), 

d r d a r d h J (r,w) 

exists and is continuous. 

We now turn to the inverse function theorem: 

Theorem A. 3. Fix an open set U C R n , and fix xq G U. Suppose K C 
C 1 {U ; R n ) is a compact set such that for all f £ K , det df (xq) ^ 0, and hence, 
\detdf(xo)\ is bounded away from uniformly for f G K. Then, there exist 
constants 81,62 > 0, such that for all f G K , 

• /Ib(x .i5i) * s a C 1 diffeomorphism onto its image. 
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• B(f(xo),5 2 )Cf(B(x ,S 1 )). 



here, B (xq,8) denotes the usual Euclidean ball centered at xq of radius S. 

Proof. This follows from a straight-forward modification of the proof in | Spi65 , 
by using the Arzela-Ascoli theorem. Alternatively, since in our proofs, we will 
always show that the relevant set is a pre-compact subset of C , by showing it is 
a bounded subset of C 2 , the derivatives of the functions in our set will actually 
be uniformly Lipschitz, and in this case one may use the theorem in [HH99 . □ 



B The Cauchy-Binet formula 

In this appendix, we review the Cauchy-Binet formula and an associated change 
of variables formula, that is essential to the work in this paper. We first recall 
some notation from Section ll.il given two integers 1 < m < n, define Z(m, n) 
to be the set of all lists of integers . . . , i m ), such that: 

1 < i\ < «2 < • • • < 4 < n. 

Given an n x q matrix A, and uq < n A q, for / £ Z(no,n), J £ X (jiQ,q) 
define the no x uq matrix Aij by using the rows from A which are listed in / 
and the columns from A listed in J. We define: 



det A 

n xn 



(det A/,j) /eX( 

no ,n) 
JeX(n ,q) 



In particular, det„ oX n ^ is a vector (it will not be important to us in which 
order the coordinates are arranged). 

A special case arises when q = n . Indeed, in this case, we have QTT57j . p. 
127): 



det A 

n xn 



= Vdet A* A 



(B.l) 



and both of these quantities are equal to the volume of the no dimensional 
parallelepiped with edges given by the columns of A. This is a special case 
of the Cauchy-Binet formula. Because of this, we obtain a change of variables 
formula which will be of use to us. Suppose $ is a C 1 diffeomorphism from an 
open subset U in R n ° mapping to an no dimensional submanifold of W 1 , where 
this submanifold is given the Lebesgue measure, dx. Then, we have: 



<£>([/) 



/ (x) dx 



/(*(*)) 



det (d$(t)) 

noX.no 



dt. 



(B.2) 
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